Preview

Izmeritel`naya Tekhnika

Advanced search
Open Access Open Access  Restricted Access Subscription Access

Estimation of traditional numerical characteristics of multimodal distribution laws of a one-dimensional random variable in conditions of large volume statistical data

https://doi.org/10.32446/0368-1025it.2025-2-47-54

Abstract

The efficiency of estimating the traditional numerical characteristics of multimodal symmetric and asymmetric distribution laws of a one-dimensional random variable with large amounts of statistical data is considered. To circumvent the problem of large samples, the formulas for discretization of the interval of values of a random variable by Sturgess, Brooks-Carruthers, Heinhold-Gaede and the formula for optimal discretization proposed by the authors of this article were used. For this purpose, data arrays have been formed that allow us to evaluate the numerical characteristics of the laws of distribution of random variables, taking into account their discrete values. Estimates of mathematical expectation, mean square deviation, coefficients of asymmetry and kurtosis are calculated from the transformed data sets. Estimates of the numerical characteristics of the considered distribution laws for continuous and discrete random variables with different volumes of initial statistical data are compared. The efficiency of methods for estimating the numerical characteristics of multimodal distribution laws based on initial statistical information and the results of transforming this information using the specifi ed discretization formulas has been established. The reliability of comparing the performance indicators of the studied methods was confirmed using the Kolmogorov-Smirnov criterion. It is shown that the Heinhold-Gaede formula and the optimal discretization formula proposed by the authors are more effective than the Sturgess and Brooks-Carruthers discretization formulas. The obtained results can be used in processing remote sensing data of natural objects, which are characterized by a large volume of statistical information and multimodal laws of distribution of spectral features.

About the Authors

A. V. Lapko
Institute of Computational Modelling of the Siberian Branch of the Russian Academy of Sciences; Reshetnev Siberian State University of Science and Technology
Russian Federation

Aleksandr V. Lapko

Krasnoyarsk



V. A. Lapko
Institute of Computational Modelling of the Siberian Branch of the Russian Academy of Sciences; Reshetnev Siberian State University of Science and Technology
Russian Federation

Vasiliy A. Lapko

Krasnoyarsk



References

1. Lapko A. V., Lapko V. A. Estimation of traditional numerical characteristics of lognormal distribution laws of a onedimensional random variable in conditions of large volume statistical data. Measurement Techniques, 67(2), 109–118 (2024). https://doi.org/10.1007/s11018-024-02332-y

2. Shipko V. V., Borzov S. M. Analysis of the efficiency of hyperspectral data classification under constraints on the quantization bit depth, the number of spectral channels, and spatial resolution. Optoelectronics, Instrumentation and Data Processing, 58(3), 273–280 (2022). https://doi.org/10.3103/s8756699022030062

3. Borzov S. M., Nezhevenko E. S. Neural network technologies for detection and classification of objects. Optoelectronics, Instrumentation and Data Processing, 59(3), 329–345 (2023). https://doi.org/10.3103/s8756699023030032

4. Lebedev I. S. Adaptive application of machine learning models on separate segments of a data sample in regression and classification problems. Information and Control Systems, (3), 20–30 (2022). (In Russ.) https://doi.org/10.31799/1684-8853-2022-3-20-30

5. Kivchun O. R. Data validation algorithm based on vector rank analysis. Information technologies, 30(4), 198–205 (2024). (In Russ.) https://doi.org/10.17587/it.30.198-205

6. Sharueva A. V., Lapko A. V., Lapko V. A., Nonparametric methods for hypotheses testing about distributions of random variables in the analysis of remote sensing data. SB RAS, Novosibirsk (2024). (In Russ.) https://doi.org/10.53954/9785604990094

7. Lapko A. V., Lapko V. A. Comparison of the effectiveness of methods for sampling the range of variation of random quantities in synthesis of nonparametric estimates of probability density. Measurement Techniques, 57(3), 222–227 (2014). https://doi.org/10.1007/s11018-014-0435-y

8. Sturgess H. A., Journal of the American Statistical Association, 21, 65–66 (1926). https://doi.org/10.1080/01621459.1926.10502161

9. Storm R. Wahrscheinlichkeitsrechnung, mathematische statistik und statistische qualitätskontrolle. Fachbuchverlag, Leipzig, (2001). (In German)

10. Heinhold J., Gaede K.-W. Ingenieur-Statistik. R. Oldenbourg Verlag, München-Wien (1972). (In German) https://doi.org/10.1002/cite.330450621

11. Lapko A. V., Lapko V. A. Integral estimate from the square of the probability density for a one-dimensional random variable. Measurement Techniques, 63(7), 534–542 (2020). https://doi.org/10.1007/s11018-020-01820-1

12. Robertson C. A., Fryer J. G. Some descriptive properties of normal mixtures. Scandinavian Actuarial Journal, 1969(3-4), 137–146 (1969). https://doi.org/10.1080/03461238.1969.10404590

13. Eisenberger I. Genesis of bimodal distributions. Technometrics, 6(4), 357–363 (1964). https://doi.org/10.1080/00401706.1964.10490199

14. Ray S., Lindsay B. G. The topography of multivariate normal mixtures. Annals of Statistics, 33(5), 2042–2065 (2005). https://doi.org/10.1214/009053605000000417


Supplementary files

Review

For citations:


Lapko A.V., Lapko V.A. Estimation of traditional numerical characteristics of multimodal distribution laws of a one-dimensional random variable in conditions of large volume statistical data. Izmeritel`naya Tekhnika. 2025;74(2):47-54. (In Russ.) https://doi.org/10.32446/0368-1025it.2025-2-47-54

Views: 70


ISSN 0368-1025 (Print)
ISSN 2949-5237 (Online)