3
Mathematicians from RUDN University and the Free University of Berlin proposed a new way of using neural networks for working with noisy high-dimensional data

Mathematicians from RUDN University and the Free University of Berlin proposed a new way of using neural networks for working with noisy high-dimensional data

Mathematicians from RUDN University and the Free University of Berlin have proposed a new approach to studying the probability distributions of observed data using artificial neural networks. The new approach works better with so-called outliers, i.e. input data objects that deviate significantly from the overall sample.

The restoration of the probability distribution of observed data by artificial neural networks is the most important part of machine learning. The probability distribution not only allows us to predict the behaviour of the system under study, but also to quantify the uncertainty with which forecasts are made. The main difficulty is that, as a rule, only the data are observed, but their exact probability distributions are not available. To solve this problem, Bayesian and other similar approximate methods are used. But their use increases the complexity of a neural network and therefore makes its training more complicated.

RUDN University and the Free University of Berlin mathematicians used deterministic weights in neural networks, which would help overcome the limitations of Bayesian methods. They developed a formula that allows one to correctly estimate the variance of the distribution of observed data. The proposed model was tested on different data: synthetic and real; on data containing outliers and on data from which the outliers were removed. The new method allows restoration of probability distributions with accuracy previously unachievable.

The mathematicians of RUDN University and the Free University of Berlin used deterministic weights for neural networks and used the networks outputs to encode the distribution of latent variables for the desired marginal distribution. An analysis of the training dynamics of such networks allowed them to obtain a formula that correctly estimates the variance of observed data, despite the presence of outliers in the data. The proposed model was tested on different data: synthetic and real. The new method allows restoring probability distributions with higher accuracy compared with other modern methods. Accuracy was assessed using the AUC method (area under the curve is the area under the graph that allows making assessment of the mean square error of the predictions depending on the sample size estimated by the network as “reliable”; the higher the AUC score, the better the predictions).

The article was published in the journal Artificial Intelligence.

International Projects View all
International scientific cooperation View all
12 Dec 2024
From 19 to 23 November 2024, RUDN hosted the III International Scientific Conference ‘For the Sustainable Development of Civilisation: Cooperation, Science, Education, Technology’. The event gathered more than 2000 participants from 72 countries.
631
Similar newsletter View all
08 Aug
Focusing on science as a way of life, sustainable development goals as a scientist's mission and new technological developments: RUDN honored leaders in science and innovation

The RUDN University Science and Innovation Prize winners were honoured at the extended meeting of the Academic Council. In 2024 the terms of the traditional RUDN University Prize were changed: for the first time the competition was announced in two categories: leading scientists and young scientists.

284
08 Aug
RUDN University scientist: Africa relies on small modular reactors to solve energy problems

According to the International Energy Agency (IEA), electricity consumption in Africa has increased by more than 100% over the past two years (2020-2022). However, 74.9% of this energy is still produced by burning organic fuels — natural gas, coal and oil. At the same time, the level of electrification on the continent remains extremely low — only 24%, while in other developing countries it reaches 40%. Even in grid-connected areas, electricity supply is often unreliable: industrial enterprises lose energy on an average of 56 days a year.

268
08 Aug
RUDN dentists developed a program that will accelerate the work of an orthodontist by 40%

Today, diagnosis and treatment planning with orthodontists takes several days. Also, complications can arise during treatment that slow down the patient's recovery process. For example, improper orthodontic treatment planning can lead to temporomandibular joint dysfunction.

140
Similar newsletter View all