The restoration of the probability distribution of observed data by artificial neural networks is the most important part of machine learning. The probability distribution not only allows us to predict the behaviour of the system under study, but also to quantify the uncertainty with which forecasts are made. The main difficulty is that, as a rule, only the data are observed, but their exact probability distributions are not available. To solve this problem, Bayesian and other similar approximate methods are used. But their use increases the complexity of a neural network and therefore makes its training more complicated.
RUDN University and the Free University of Berlin mathematicians used deterministic weights in neural networks, which would help overcome the limitations of Bayesian methods. They developed a formula that allows one to correctly estimate the variance of the distribution of observed data. The proposed model was tested on different data: synthetic and real; on data containing outliers and on data from which the outliers were removed. The new method allows restoration of probability distributions with accuracy previously unachievable.
The mathematicians of RUDN University and the Free University of Berlin used deterministic weights for neural networks and used the networks outputs to encode the distribution of latent variables for the desired marginal distribution. An analysis of the training dynamics of such networks allowed them to obtain a formula that correctly estimates the variance of observed data, despite the presence of outliers in the data. The proposed model was tested on different data: synthetic and real. The new method allows restoring probability distributions with higher accuracy compared with other modern methods. Accuracy was assessed using the AUC method (area under the curve is the area under the graph that allows making assessment of the mean square error of the predictions depending on the sample size estimated by the network as “reliable”; the higher the AUC score, the better the predictions).
The article was published in the journal Artificial Intelligence.
Yakov Kuzyakov, a well-known soil scientist, a leading scientist at RUDN Agricultural and Technological Institute and a professor at the King Saud University, has been awarded the title of highly cited researcher in the field of agricultural sciences (according to Clarivate).
A RUDN chemist has obtained a new compound — a dumbbell-shaped phosphate-bridged molybdenum cluster. The cluster accelerates the reaction of the formation of sulfides from oxides and can be used in pharmaceutical and cosmetic manufacturing.
Mathematicians from RUDN University have studied the properties of compositional operators in spaces with mixed Lebesgue norms. It will help describe the diffusion of liquids in materials with cracks and in porous materials. Such spaces are also useful for obtaining estimates for solutions to the Navier-Stokes equation.