Accelerate Literature Icon
Want to do a literature review? Try our new Literature Review workflow

COMPARATIVE ANALYSIS OF THE NORMALITY OF STATISTICAL CRITERIA FOR SAMPLES OF CONTAMINATED DATA

  • TL;DR
  • Abstract
  • Literature Map
  • Similar Papers
TL;DR

This study evaluates the robustness of statistical normality tests—Student's t-test, Kolmogorov-Smirnov, and Anderson-Darling—in small contaminated samples, using numerical modeling and visual analysis, and offers practical guidelines for selecting appropriate criteria considering contamination levels and sample size.

Abstract
Translate article icon Translate Article Star icon

The paper considers the influence of contaminated samples with anomalous observations on the reliability of statistical analysis results and hypothesis testing for sample homogeneity. The main focus is on visual data analysis as an effective means of preliminary research. The use of histograms, scatter plots, and density estimates allows for the visual identification of outliers, the assessment of the distribution shape, and the detection of differences between samples. The purpose of the study is to evaluate the robustness of popular statistical criteria for testing the normality of distribution in the presence of contamination in small samples. The scientific novelty lies in the quantitative study of the impact of different types of contamination on the results of popular criteria, as well as in the practical assessment of their behavior under conditions of violation of assumptions about data homogeneity. The practical novelty lies in the development of recommendations for practitioners on the selection of the optimal criterion when analyzing samples with possible anomalous observations, taking into account the stability of statistical methods. Research methods include numerical modeling of samples with controlled introduction of structural contaminants, assessment of the frequency of false rejections/acceptance of the null hypothesis, as well as comparative analysis of the results obtained using the following statistical criteria: Student's t-test for comparing the mean values of two samples; the Kolmogorov-Smirnov one-sample test to verify the conformity of the empirical distribution with the theoretical one; the Anderson-Darling criterion to verify the normality assessment; the Kolmogorov-Smirnov two-sample test to verify the homogeneity of two distributions. The results of the study showed the importance of choosing the appropriate criterion depending on the sample size and the expected level of contamination. Presenting the average values and ranges for N repetitions of the experiment allows for a visual assessment of the stability and reliability of each test in the presence of contaminated data. Based on the experiments conducted, practical recommendations are proposed for the preliminary diagnosis of samples and the selection of the optimal approach to testing hypotheses in the presence of contaminated data.

Similar Papers
  • Book Chapter
  • Cite Count Icon 2
  • 10.1007/978-3-030-34365-1_9
A System and Model of Visual Data Analytics Related to Junior High School Students
  • Jan 1, 2019
  • Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
  • Dang Van Pham + 1 more

The assessment of students’ learning ability for career guidance in the future is a huge challenge. The development stage of students’ learning ability is considered from the sixth grade to the ninth grade. Student’s transcripts from grade 6 to grade 9 are used to assess students’ learning abilities. A transcript comparison of grades 6 through 9 is essential for each parent and analyst from there they can guide their children to comprehensive development of knowledge. The objective of this paper is to visually analyze student data using visual analysis approach, proposes a visual analysis system for data discovery with many variables (VAS), a visual data analysis model, visual data analysis criteria, visual data variables, multidimensional cube representing student data, and some visual data analysis questions based on visual graphs related to Junior High School students (JHSSs). Visual analysis of student data helps parents or analysts observe and extract useful information that they interact visual on visual graphs by asking themselves or answering the visual data analysis questions themselves when observing visual graphs by the retina to guide their children to choose the right knowledge chain and future jobs. Visual graphs represent the correlation between subjects and especially the comparison of a subject in the academic years together to help parents and analysts see clearly the trend of the development of students’ learning abilities by visual data analysis model.

  • Research Article
  • 10.21831/jpmmp.v8i2.71583
Workshop on Visual Data Analysis with R Program
  • Aug 26, 2024
  • Jurnal Pengabdian Masyarakat MIPA dan Pendidikan MIPA
  • Dhoriva Urwatul Wutsqa + 6 more

Statistics data analysis generally focuses more on mathematical procedures than visual. Visual analysis is very useful for research and this is still very limited to study at Universitas Mercu Buana Yogyakarta, so the UNY Statistics lecturer’s service activity is holding visual data analysis workshop with the R program, where this program is open source and is complete for visual analysis. The material for this activity is about procedures and uses for visual data analysis, introduction to the R program, data management with the R program, visual data analysis for group descriptions and comparisons, and visual data analysis for relationships between variables. Evaluation of participants' ability to understand the material is measured through 14 questions with four Likert Scale responses. Based on 40 questionnaires, 27,86% answered "Strongly Agree", 71,96% "Agree", and 0,18% "Disagree" regarding understanding and applying visual data analysis techniques with the R program. Therefore, it can be concluded that the majority of participants could understand the workshop material and follow the training well.

  • Research Article
  • Cite Count Icon 26
  • 10.1016/j.pmrj.2016.10.018
The Value of Scatter Plots
  • Nov 2, 2016
  • PM&R
  • Kristin L Sainani

The Value of Scatter Plots

  • Research Article
  • 10.24127/diversifikasi.v5i3.5376
Pengaruh Semangat Kerja Dan Lingkungan Kerja Terhadap Kinerja Karyawan Dengan Gaya Kepemimpinan Sebagai Variabel Moderating (Studi Kasus Pada CV. Maju Sejahtera Intimojopahit)
  • Sep 12, 2025
  • Jurnal Manajemen DIVERSIFIKASI
  • Ahmad Fatur Rahman Sampurna Jaya + 2 more

The aim of this research is to determine whether there is an influence of work morale and work environment on employee performance with leadership style as a moderating variable on CV. Maju Prosperous Inti Mojopahit. The role of employees is very important to the success or failure of the company. Problems related to performance are also a tragic issue for companies that program human resource problems. Success in performance depends on work morale, work environment, and leadership style. With less than optimal performance caused by a lack of work enthusiasm among employees and a poor work environment, as well as the implementation of a leadership style that is not wise enough, the results obtained are less than optimal. This research uses quantitative research; the sample in this research is 40 employees. The data collection technique uses a questionnaire, with data analysis using Validity Test, Reliability Test, Normality Test, Linearity Test, Homogeneity Test, MRA Interaction Test, R Determination Test, and Statistical Hypothesis Test. By using data processing program SPSS version 22, The research results show that work enthusiasm influences employee performance, the work environment influences employee performance, and leadership style as a moderation variable is able to strengthen work enthusiasm for employee performance at CV. Maju Sejahtera Inti Mojopahit and leadership style as a moderation variable are able to strengthen the work environment and employee performance at CV. Maju Prosperous Inti Mojopahit.

  • Research Article
  • 10.57053/itqan.v5i1.131
The Impact of Compensation and Career Development on Employee Performance at PT. XYZ
  • Oct 1, 2025
  • ITQAN: Journal of Islamic Economics, Management, and Finance
  • Kurnia Solehah + 2 more

This research is motivated by the emergence of problems related to compensation and career development disparities among permanent employees, contract employees, and outsourced employees. The purpose of this study is to determine the effect of compensation and career development on employee performance. This study uses a quantitative method with a stratified random sampling technique conducted at PT. XYZ conducted a study with a sample of 165 people, utilizing a data collection technique involving a questionnaire. Data analysis techniques using SPSS include instrument testing (validity and reliability tests), analyst prerequisite tests (normality and homogeneity tests), and statistical hypothesis testing (product-moment correlation tests and linear regression tests). Based on the results of the t-statistical test, there is a partial effect of compensation (X1) on employee performance (Y), as marked by the calculated t-value of 13.787, which exceeds the t-table value of 1.970. Additionally, there is a partial effect of career development (X2) on performance (Y), as indicated by the calculated t-value of 17.642, which exceeds the t-table value of 1.970. The sig. Value. F Change of 0.000 < 0.05 concludes that the compensation variables (X1) and career development (X2) simultaneously have a significant relationship with employee performance (Y).

  • Research Article
  • Cite Count Icon 12
  • 10.1097/01.hj.0000396585.52118.6b
Enough is enough: A primer on power analysis in study designs
  • Apr 1, 2011
  • The Hearing Journal
  • Chi-Chuen Lau + 1 more

Enough is enough: A primer on power analysis in study designs

  • Research Article
  • Cite Count Icon 367
  • 10.1109/tvcg.2012.110
Visualization and Visual Analysis of Multifaceted Scientific Data: A Survey
  • Mar 1, 2013
  • IEEE Transactions on Visualization and Computer Graphics
  • Johannes Kehrer + 1 more

Visualization and visual analysis play important roles in exploring, analyzing, and presenting scientific data. In many disciplines, data and model scenarios are becoming multifaceted: data are often spatiotemporal and multivariate; they stem from different data sources (multimodal data), from multiple simulation runs (multirun/ensemble data), or from multiphysics simulations of interacting phenomena (multimodel data resulting from coupled simulation models). Also, data can be of different dimensionality or structured on various types of grids that need to be related or fused in the visualization. This heterogeneity of data characteristics presents new opportunities as well as technical challenges for visualization research. Visualization and interaction techniques are thus often combined with computational analysis. In this survey, we study existing methods for visualization and interactive visual analysis of multifaceted scientific data. Based on a thorough literature review, a categorization of approaches is proposed. We cover a wide range of fields and discuss to which degree the different challenges are matched with existing solutions for visualization and visual analysis. This leads to conclusions with respect to promising research directions, for instance, to pursue new solutions for multirun and multimodel data as well as techniques that support a multitude of facets.

  • Conference Article
  • Cite Count Icon 1
  • 10.1109/rams.2019.8769007
Replication of Field Scenario in Reliability Testing
  • Jan 1, 2019
  • Narasimman Sunderajan + 1 more

Reliability testing is carried out on a product to identify design weakness or measure/demonstrate product reliability. A conventional reliability test method will follow statistical techniques to establish test hours and sample size. Reliability tests are accelerated by testing the product at higher stress levels to rapidly increase the product ageing and see failures quicker. However, the success of every reliability test is based on the effectiveness of correlating to actual field scenarios. This paper explains how to replicate the field scenario in a reliability test by combining multiple stresses and leveraging kurtosis [1] in vibration to attain higher acceleration factors, lower test times, uncover failures and estimate the product reliability.

  • Research Article
  • Cite Count Icon 3
  • 10.1007/s11042-019-08001-5
Visual analytics towards axle health of high-speed train based on large-scale scatter image
  • Aug 2, 2019
  • Multimedia Tools and Applications
  • Kunlin Zhang + 3 more

Axle is an important part of high-speed train. The axle is the key component connecting the train wheelset, which has a great impact on the train safety. The health monitoring of axles is very important for the safe and smooth operation of trains. The axle health detection is a complex process of multi-factor coupling, which faces the problems of health model construction. From the perspective of big data visual analysis, this paper helps people find the information behind the big data of high-speed railway axle monitoring, and makes a prediction and analysis of the health status of high-speed railway axle operation. Starting from the present situation of scatter plot presentation of multi-dimensional data visual analysis, this paper proposes a visual analysis and processing method for high-speed train axle health monitoring, aiming at the problems of intensive rendering, visual mutation and trend prediction when drawing large data scatter plot. Firstly, a new method of the axle data fusion model is proposed in this paper, which can effectively clean the axle health monitoring data and construct the data acquisition and expression mode of axle temperature of high-speed train. Then, visualization of axle data and prediction of axle health trend provide a new analysis model for axle health monitoring. In addition, the visual analysis method of scatter density map data can eliminate the dependence of the original complex mechanical model, and can be used to analyze different working conditions and axle types. Compared with the existing axle health monitoring methods, this method has high accuracy and practicability.

  • Research Article
  • 10.21275/sr24828180619
Post Operative Cognitive Dysfunction After Cardiac Surgeries - An Institutional Study
  • Aug 5, 2024
  • International Journal of Science and Research (IJSR)
  • Sanjeev Pandian R + 2 more

Background: Post Operative Cognitive Dysfunction (POCD) is a less known but serious complication after any major surgery especially after open cardiac surgeries. Cognitive decline after major surgeries can be transient. But the impact of POCD in the post operative recovery is troublesome which makes it a topic of interest. POCD can range from impaired memory, attention, learning, executive function to impairment of psychomotor dexterity. It can also be accompanied by behavioural changes. POCD by definition requires both pre-and postoperative psychometric tests. Though there are several studies on POCD worldwide, a reliable test has not been identified to diagnose POCD. Aim: The aim of this study is to establish the incidence of POCD and devise a reliable methodical test to diagnose POCD with available resources at Dept of Cardiothoracic Surgery, Govt Stanley medical college hospital. Settings and Design: Department of cardio thoracic surgery, retrospective study Methods and Material: Inclusion Criteria: All patients who underwent elective open cardiac surgeries in department of CTS in Stanley medical college Exclusion Criteria: 1. All patients who had previous cognitive disorders 2. Patients with psychiatric illnesses 3. Patients with preexisting cerebral dysfunction, pulmonary, renal insufficiency SAMPLE SIZE : 100 PATIENTS DURATION : Cases operated from 2020 to 2023 Statistical analysis used: SAS Results: 1.Incidence of POCD is 46% at the time of discharge, 29% at six weeks and 18% at six months after surgery 2.Montreal cognitive Assessment (MoCA) as a preoperative screening test and Wisconsin Card Sorting Test (WCST) as a post operative confirmatory test are reliable tests in our setup 3.Incidence of POCD is higher with CABG ON PUMP, followed by other open cardiac cases such as valvular replacements especially ones involving aortic valves. Conclusions: POCD is a serious complication of cardiac surgery. The incidence of POCD is the highest at the time of discharge which gradually decreases over the period of time. MoCA and WCST are useful tests for cognition where in the patients are subjected to simple methods of cognitive functional analysis.

  • Research Article
  • Cite Count Icon 9
  • 10.1080/15305058.2016.1224888
Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test
  • Oct 4, 2016
  • International Journal of Testing
  • Yi-Hsuan Lee + 1 more

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The objective was to provide bounds of the likely DIF effects on these measurement consequences. Five factors were manipulated: test length, percentage of DIF items per form, item type, sample size, and level of group ability difference. Results indicate that the greatest DIF effect was less than 2 points on the 0 to 60 total score scale and about 0.15 on the IRT ability scale. DIF had a limited effect on the ratio of true-score variance to observed-score variance, but its influence on the standard error of estimation for the IRT ability parameter was evident for certain ability values.

  • Research Article
  • 10.24127/diversifikasi.v4i1.2670
Pengaruh Kepercayaan, Kualitas Layanan, Lokasi Dan Pengaruhnya Terhadap Kepuasan Anggota BMT Assafiyah Kota Gajah Lampung Tengah
  • Mar 12, 2024
  • Jurnal Manajemen DIVERSIFIKASI
  • Ratmono + 2 more

Increasing banking competition is marked by the emergence of many financial institutions.This competition can be seen from the variety of products or services offered, one of which isBMT Assyafiiyah. banking companies or BMT Assyafiiyah must implement a good strategy sothat members are interested in saving at financial institutions. to attract member satisfactionis done by implementing the right strategy, including Trust, Quality of Service, and Location.The data analysis method used is quantitative and qualitative analysis. and the population inthis study were members of BMT Assyafiiyah, totaling 1,027 members with a sample of 91members and tested using: validity test, reliability test, normality test, linearity test,homogeneity test, multiple linear regression analysis, T-test, F test, Test the coefficient ofdetermination R2 and statistical hypothesis. Based on the results of the study, shows that theindicators used in the framework variables including Trust, Service Quality, and Locationhave a positive and significant influence on satisfaction. This shows that if BMT Assyafi'iyahKota Gajah can provide good satisfaction, it will facilitate BMT Assyafi'iyah Kota Gajah inrecruiting members. Conclusions in this study are: There is a positive and significant effect ofTrust, Service Quality, and Location on Member Satisfaction at BMT Assyafi'iyah theNational Blessing of the Kota Gajah.

  • Research Article
  • Cite Count Icon 2
  • 10.24127/diversifikasi.v3i1.1578
Pengaruh Aksesibilitas, Fasilitas Dan Daya Tarik Terhadap Kepuasan Pengunjung Taman Metro Indonesia Indah Kota Metro Lampung
  • Mar 7, 2023
  • Jurnal Manajemen DIVERSIFIKASI
  • Resi Novita Sari + 1 more

This study aimed to determine the effect of accessibility, facilities, and attractiveness on visitor satisfaction of Taman Metro Indonesia Indah, Metro City, Lampung. This research was conducted on visitors to Taman Metro Indonesia Indah, Metro City, Lampung, in 2021. This type of research used qualitative and quantitative research. Data collection was obtained by distributing questionnaires in questions about accessibility, facilities, and attractiveness to 100 respondents, then processed using the SPSS version 20 program. Instrument testing through validity and reliability tests. Testing requirements analysis through normality test, linearity test, and homogeneity test. Hypothesis testing is done through multiple linear regression models, partial t-test, and F simulative test. Then analyze the coefficient of determination (R2) and statistical hypotheses. The results showed that accessibility and facilities had a significant and insignificant effect on visitor satisfaction. While the attractiveness and significant effect on visitor satisfaction.

  • Research Article
  • 10.24127/diversifikasi.v5i3.5591
Pengaruh Orientasi Kewirausahaan Dan Strategi Bisnis Terhadap Kinerja Usaha (Studi Kasus Pada Usaha Pembenihan Ikan Nila Di Desa Bumi Harjo Batanghari Lampung Timur)
  • Sep 12, 2025
  • Jurnal Manajemen DIVERSIFIKASI
  • Damia Latifa + 2 more

This research is motivated by the problem of the high level of competitiveness of tilapia hatchery businesses that sell similar seeds, the lack of business capital to develop businesses such as adding ponds and reaching customers outside the city, the lack of optimal managerial ability in managing the business, the low level of skill of tilapia hatchery business actors in utilizing technology and workforce capabilities that are not yet professional because they have not gone through clear selection only emphasize brotherhood. This research aims to find out whether entrepreneurial orientation and business strategy individually and together have an effect on business performance. The sample for this research was tilapia hatchery business actors in Bumi Harjo Village, Batanghari, East Lampung, totaling 50 respondents. The research method uses a quantitative type of research. Data collection used questionnaires, and data analysis used validity tests, reliability tests, normality tests, linearity tests, homogeneity tests, multiple linear regression analysis, t tests, f tests, coefficient of determination r2 tests and statistical hypotheses using SPSS version 25 program tools. The results of this research show that (1) entrepreneurial orientation has a positive and significant effect on business performance with a tcount of 25.944 > ttable 2.011 and sig 0.000 > 0.05. (2) business strategy has a positive and significant effect on business performance with a tcount of 32.717 > ttable 2.011 and sig 0.000 > 0.05. (3) entrepreneurial orientation has a positive and significant effect on business performance with a value of fcount > ftable 3.20 and sig 0.000 > 0.05.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 5
  • 10.24127/jpf.v7i1.1923
PENGARUH KETRAMPILAN PROSES SAINS TERINTEGRASI KARAKTER TERHADAP HASIL BELAJAR FISIKA
  • Mar 29, 2019
  • Jurnal Pendidikan Fisika
  • Imas Ratna Ermawati + 2 more

This research was conducted to find out whether the test of science process skill integrated character to influence on high school physics learning result. The research was conducted at SMAN 1 Maumere in class XI MIA 1 as experiment class and XI MIA 2 as control class with material that is optical equipments in the even semester of academic year 2017/2018. The research method used is experiment with Quasi Experimental Quasi Experimental Experimental design that used is The Noneequivalent Pretest-Posttest Control Group Design ,, while the research subject taking technique is Simple Random Sampling. Research subjects used amounted to 33 students both from the experimental class and control class. Before the research instrument is used first tested the validity and reliability test. In the validity test using Product Moment Pearson correlation, there are 24 tested questions and obtained the result of 18 valid questions and 6 drop problems. In reliability test by using Cronbach Alpha formula in get result rhitung = 0,870> 0,329; so the instrument is said to reliab with high interpretation. The next stage conducted data retrieval using instruments that have been valid and reliable then tested the prerequisite analysis in the form of normality test and homogeneity test. In the normality test using chi square obtained results; with a significant level of α = 0.05 and a sample size of 33 students in the experimental class and in the control class which means that the experimental and control classes are normally distributed. On the homogeneity test using F test obtained the results,; with a significant level α = 0.05 and the number of samples of 33 students then the class of research is a homogeneous class. In the last stage, hypothesis testing with t-test and obtained the result,; with significant level α = 0.05 and the number of samples as many as 33 students, the skills of integrated science processes developed characters that influence the results of physics learning.

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant