

Estimation of traditional numerical characteristics of multimodal distribution laws of a one-dimensional random variable in conditions of large volume statistical data
https://doi.org/10.32446/0368-1025it.2025-2-47-54
Abstract
The efficiency of estimating the traditional numerical characteristics of multimodal symmetric and asymmetric distribution laws of a one-dimensional random variable with large amounts of statistical data is considered. To circumvent the problem of large samples, the formulas for discretization of the interval of values of a random variable by Sturgess, Brooks-Carruthers, Heinhold-Gaede and the formula for optimal discretization proposed by the authors of this article were used. For this purpose, data arrays have been formed that allow us to evaluate the numerical characteristics of the laws of distribution of random variables, taking into account their discrete values. Estimates of mathematical expectation, mean square deviation, coefficients of asymmetry and kurtosis are calculated from the transformed data sets. Estimates of the numerical characteristics of the considered distribution laws for continuous and discrete random variables with different volumes of initial statistical data are compared. The efficiency of methods for estimating the numerical characteristics of multimodal distribution laws based on initial statistical information and the results of transforming this information using the specifi ed discretization formulas has been established. The reliability of comparing the performance indicators of the studied methods was confirmed using the Kolmogorov-Smirnov criterion. It is shown that the Heinhold-Gaede formula and the optimal discretization formula proposed by the authors are more effective than the Sturgess and Brooks-Carruthers discretization formulas. The obtained results can be used in processing remote sensing data of natural objects, which are characterized by a large volume of statistical information and multimodal laws of distribution of spectral features.
Keywords
About the Authors
A. V. LapkoRussian Federation
Aleksandr V. Lapko
Krasnoyarsk
V. A. Lapko
Russian Federation
Vasiliy A. Lapko
Krasnoyarsk
References
1. Lapko A. V., Lapko V. A. Estimation of traditional numerical characteristics of lognormal distribution laws of a onedimensional random variable in conditions of large volume statistical data. Measurement Techniques, 67(2), 109–118 (2024). https://doi.org/10.1007/s11018-024-02332-y
2. Shipko V. V., Borzov S. M. Analysis of the efficiency of hyperspectral data classification under constraints on the quantization bit depth, the number of spectral channels, and spatial resolution. Optoelectronics, Instrumentation and Data Processing, 58(3), 273–280 (2022). https://doi.org/10.3103/s8756699022030062
3. Borzov S. M., Nezhevenko E. S. Neural network technologies for detection and classification of objects. Optoelectronics, Instrumentation and Data Processing, 59(3), 329–345 (2023). https://doi.org/10.3103/s8756699023030032
4. Lebedev I. S. Adaptive application of machine learning models on separate segments of a data sample in regression and classification problems. Information and Control Systems, (3), 20–30 (2022). (In Russ.) https://doi.org/10.31799/1684-8853-2022-3-20-30
5. Kivchun O. R. Data validation algorithm based on vector rank analysis. Information technologies, 30(4), 198–205 (2024). (In Russ.) https://doi.org/10.17587/it.30.198-205
6. Sharueva A. V., Lapko A. V., Lapko V. A., Nonparametric methods for hypotheses testing about distributions of random variables in the analysis of remote sensing data. SB RAS, Novosibirsk (2024). (In Russ.) https://doi.org/10.53954/9785604990094
7. Lapko A. V., Lapko V. A. Comparison of the effectiveness of methods for sampling the range of variation of random quantities in synthesis of nonparametric estimates of probability density. Measurement Techniques, 57(3), 222–227 (2014). https://doi.org/10.1007/s11018-014-0435-y
8. Sturgess H. A., Journal of the American Statistical Association, 21, 65–66 (1926). https://doi.org/10.1080/01621459.1926.10502161
9. Storm R. Wahrscheinlichkeitsrechnung, mathematische statistik und statistische qualitätskontrolle. Fachbuchverlag, Leipzig, (2001). (In German)
10. Heinhold J., Gaede K.-W. Ingenieur-Statistik. R. Oldenbourg Verlag, München-Wien (1972). (In German) https://doi.org/10.1002/cite.330450621
11. Lapko A. V., Lapko V. A. Integral estimate from the square of the probability density for a one-dimensional random variable. Measurement Techniques, 63(7), 534–542 (2020). https://doi.org/10.1007/s11018-020-01820-1
12. Robertson C. A., Fryer J. G. Some descriptive properties of normal mixtures. Scandinavian Actuarial Journal, 1969(3-4), 137–146 (1969). https://doi.org/10.1080/03461238.1969.10404590
13. Eisenberger I. Genesis of bimodal distributions. Technometrics, 6(4), 357–363 (1964). https://doi.org/10.1080/00401706.1964.10490199
14. Ray S., Lindsay B. G. The topography of multivariate normal mixtures. Annals of Statistics, 33(5), 2042–2065 (2005). https://doi.org/10.1214/009053605000000417
Supplementary files
Review
For citations:
Lapko A.V., Lapko V.A. Estimation of traditional numerical characteristics of multimodal distribution laws of a one-dimensional random variable in conditions of large volume statistical data. Izmeritel`naya Tekhnika. 2025;74(2):47-54. (In Russ.) https://doi.org/10.32446/0368-1025it.2025-2-47-54