Feature Importance Mapping in performative predictions

Publication date

DOI

Document Type

Master Thesis

Collections

Open Access logo

License

CC-BY-NC-ND

Abstract

Machine learning models have become widely used in recent years. As models are deployed and interact with real-world environments, they are susceptible to performance deterioration due to various factors such as data drift and model- induced changes in the surrounding environment. Addressing these challenges requires innovative approaches to maintain model effectiveness over time. This research focuses on devising strategies to mitigate the effects of perfor- mative drift by exploring feature transformation techniques and robust classi- fier training methods. Drawing inspiration from transfer learning concepts, the study aims to find feature representations resilient to drift or capable of reversing its effects. Additionally, it investigates the feasibility of training drift-resistant classifiers in transformed feature spaces. The research questions investigate the availability of performative data genera- tors, methods for computing feature transformations, and the impact of these transformations on data distributions. Furthermore, the study examines the possibility of training robust classifiers independent of the strength of perfor- mative effects and explores potential modifications to improve the effectiveness of the proposed methods. The main innovation introduced in this paper is the design of an architecture capable of providing drift-resistant classification and mapping of points back to the starting distribution. The devised model is a synthesis of a domain adversarial neural network with a generative adversarial neural network. The main experimental method used by this thesis is simulation, combining performative data generators available in the literature, existing transfer learn- ing and newly created architecture. Finally, a series of experiments has been performed and it has been proven that under certain conditions it is possible to train a stable classifier. Alongside that classifier, a generator network is trained. That network with some approximation can reproduce the original form of the dataset, which has been influenced by performative drift.

Keywords

performative drift; alternative to retraining; generative domain adversarial neural network; mapping function; data distribution;

Citation