ENACT Apprentissage et suivi des distributions de données sous forme de graphes neuronaux pour l'apprentissage continu embarqué

Offre de thèse

Date limite de candidature

25-04-2025

Date de début de contrat

01-09-2025

Directeur de thèse

GIRAU Bernard

Encadrement

Comité de suivi individuel de thèse tel que défini par l'ED IAEM

Type de contrat

ANR Financement d'Agences de financement de la recherche

Candidater à cette offre

école doctorale

IAEM - INFORMATIQUE - AUTOMATIQUE - ELECTRONIQUE - ELECTROTECHNIQUE - MATHEMATIQUES

équipe

BISCUIT

contexte

This PhD offer is provided by the ENACT AI Cluster and its partners. Find all ENACT PhD offers and actions on https://cluster-ia-enact.ai/. This PhD will be supervised by Bernard Girau, head of the Biscuit team of the LORIA laboratory. LORIA is the French acronym for the “Lorraine Research Laboratory in Computer Science and its Applications” and is a research unit (UMR 7503), common to CNRS, the University of Lorraine, CentraleSupélec and Inria. LORIA's missions mainly deal with fundamental and applied research in computer sciences. With 500 people working in the lab, LORIA is today one of the biggest research labs in the Grand Est Region. The long-term goal of the Biscuit team is to propose bio-inspired computing architectures that will learn to control persistent autonomous systems interacting with dynamical, complex and unforeseeable environments. These architectures are mostly inspired by neural mechanisms of self-organization and computation emergence from neural populations driven by sensorimotor feedback. The adaptation of these architectures for hardware implementations (on FPGAs or neuromorphic chips) is also a major topic of the team.

spécialité

Informatique

laboratoire

LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications

Mots clés

réseaux de neurones, quantification vectorielle topographique, apprentissage continu, gaz neuronaux, distributions non stationnaires, compatibilité matérielle

Détail de l'offre

Ce sujet s'inscrit dans le cadre du projet ANR SORLAHNA, dont les objectifs sont : 1) concevoir une méthodologie d'implémentation matérielle flexible pour des architectures neuronales polyvalentes de quantification vectorielle topographique (QV), basée sur un NoC (réseau sur puce) capable d'instancier une topologie virtuelle dynamique sur un circuit programmable ; et 2) définir des modèles de QV topographique pour l'acquisition dynamique de données enregistrées à partir d'un réseau distribué de capteurs et destinées à l'apprentissage automatique. Cette thèse se concentre sur ce deuxième aspect.

L'étudiant analysera d'abord les différents algorithmes neuronaux de QV topographique existants (SOM, NG), ainsi que leurs différentes variantes proposées pour les données non stationnaires (DSOM, CSOM [4], NP-SOM [5], PSOM [6], GNG, GNG-T [7], GWR, etc.), dans le contexte de l'apprentissage continu (CL). La plupart des travaux actuels sur la CL considèrent un problème dont les statistiques sont stationnaires, mais pour lequel ces échantillons sont fournis de manière non i.i.d., comme par exemple quand on fournit d'abord des échantillons de certaines classes seulement, puis des échantillons d'autres classes dans un second temps, alors que l'objectif est de traiter un problème de classification sur toutes ces classes. Nous avons étendu le CL aux cas où les statistiques sont non stationnaires, ce qui conduit à considérer des problèmes où les données d'apprentissage sont fournies en ligne et extraites : 1) avec un processus d'échantillonnage non i.i.d., 2) à partir de distributions qui évoluent lentement. Nous avons proposé un formalisme pour ce CL étendu, définissant différents contextes d'apprentissage qui dérivent dans le temps. Nous étudions comment les modèles multi-VQ (chaque modèle apprend un contexte et s'adapte à ses changements) peuvent aborder des problèmes aussi complexes. Un aspect clé est de pouvoir identifier le contexte actuel à partir des données fournies en ligne. L'étudiant analysera les liens entre la distribution des données et la forme du graphe appris par les modèles de QV topographique, afin de définir des algorithmes capables de fournir une bonne estimation du contexte actuel ou de déterminer un nouveau contexte. Pour faire le lien avec l'objectif matériel du projet Sorlahna, cette étude devra prendre en compte la tolérance des algorithmes de QV topographique aux contraintes matérielles de codage arithmétique simplifié et de calcul à précision limitée. Si les SOM se sont avérées robustes à différents types de perturbations de calcul ([8]), cette robustesse devra être évaluée et étendue à la préservation de la forme du graphe appris pour les modèles de QV topographique que nous utilisons avec des données non stationnaires : le compromis fin entre stabilité et dynamicité requis pour le CL risque d'entrer en conflit avec la perte de capacité de réglage très fin inhérente à la réduction de la précision des calculs. L'étudiant étudiera ensuite les méthodes et propriétés permettant de (re)générer des données uniquement à partir des informations codées dans le graphe appris par QV topographique, ces données devant être statistiquement représentatives du problème ou du contexte. Cette question dépend directement de la capacité des modèles de QV à bien modéliser la densité de probabilité de l'espace d'entrée dans le cadre du CL étendu. Une fois la capacité de (re)génération de nos algorithmes de QV topographique validée, des protocoles de transmission dynamique des informations de QV seront définis afin d'optimiser l'acquisition de données à partir de capteurs physiquement distribués, lorsque l'information recherchée n'est pas la connaissance des valeurs exactes perçues, mais la connaissance de leur distribution, comme cela peut être le cas lors de tâches d'apprentissage automatique.

Keywords

neural networks, topographic vector quantization, continual learning, neural gases, non-stationary distributions, hardware compliance

Subject details

This subject is part of the ANR SORLAHNA project, whose two objectives are: 1) to design a sufficiently flexible hardware implementation methodology for versatile neural architectures of topographic vector quantization (VQ), based on a NoC (network on chip ) able to instantiate a dynamic virtual topology on a programmable circuit, and 2) to define topographic VQ models for the dynamic acquisition of data recorded from a distributed network of sensors and intended for machine learning. This thesis focuses on this second aspect. In a first phase, the student will analyze the different existing topographic VQ neural algorithms (SOM, NG), as well as their different variants proposed for non-stationary data (DSOM, CSOM [4], NP-SOM [5], PSOM [6], GNG, GNG-T [7], GWR, etc.), in the context of continual learning (CL). Most current work on CL considers a problem whose statistics are stationary, from which samples could be extracted in an i.i.d manner, but for which these samples are provided in a non-i.i.d manner, for practical or technical reasons,. The typical example is that of a classification where we first provide samples from certain classes only, then samples from other classes in a second step, whereas the goal is to deal with a classification problem on all these classes. We have extended the CL problem to cases where statistics are intrinsically non-stationary, which leads to consider problems where learning data are provided on-line and extracted: 1) with a non-i.i.d. sampling process, 2) from distributions that slowly change. We have proposed a formalism for this extended CL, defining different contexts of learning that drift with time. We study how multi-VQ models (each model learns a context and adapts to its changes) may tackle such complex problems. A key aspect is to be able to identify the current context from the data provided on-line. The student will analyze the close links that exist between the data distribution and the shape of the graph that topographic VQ algorithms learn, so as to define anytime algorithms able to provide a good estimate of the current context, or to assert that a new context has been found. In order to take into account the first objective of the Sorlahna project (hardware), this study must consider the tolerance of the topographic VQ algorithms to the most classic constraints of hardware implementations: simplified arithmetic coding and limited precision calculation. If SOMs have proven robust to different types of calculation perturbations ([8]), this robustness will have to be evaluated and extended to graph shape preservation for the topographic VQ models we use with non-stationary data: the fairly fine compromise between stability and dynamicity required for CL risks colliding with the loss of very fine tuning capability that is inherent in reducing the precision of the calculations. In a second phase, the student will study the methods and properties that make it possible to (re)generate data only from the information encoded in the graph learned by topographic VQ, such data having to be statistically representative of the problem or context. This question directly depends on the ability of VQ models to “well” model the probability density of the input space, but it becomes more complex in a context of CL and dynamic evolution of the data distribution. Validation of the (re)generative capacity of our topographic VQ algorithms will pave the way for the third phase of the work. In this phase, protocols for the dynamic transmission of VQ information will be defined in order to optimize data acquisition from physically distributed sensors, when the information that is sought is not the knowledge of the exact values perceived, but the knowledge of their distribution, as it can be the case during machine learning tasks.

Profil du candidat

Le/La candidat(e) devra être titulaire d'un diplôme équivalent à un master en informatique, de préférence dans une spécialité liée à l'intelligence artificielle et/ou au calcul numérique distribué. Une connaissance adéquate de la conception de matériel numérique sera appréciée, ainsi qu'une solide expérience en conception logicielle. Toute expérience dans le domaine de l'informatique neuromorphique sera un atout important. Le/La candidat(e) devra parler couramment l'anglais et/ou le français.

Candidate profile

The candidate should have the equivalent of a Master in Computer Science, preferably in a specialty related to artificial intelligence and/or distributed numerical computation. Adequate knowledge of digital hardware design will be assessed, as well as a solid experience in software design. Any work already done in the field of neuromorphic computing will be an important asset. The candidate must be fluent in English and/or French.

Référence biblio

[1] T. Kohonen. Self-organized formation of topologically correct feature maps in Biological Cybernetics, vol. 43(1), pp. 59–69, 1982.
[2] B. Fritzke. A growing neural gas network learns topologies In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Advances in Neural Information Processing Systems 7, pages 625 632. MIT Press, Cambridge MA, 1995.
[3] L. Wang, X. Zhang, H. Su, and J. Zhu. A comprehensive survey of continual learning : Theory, method and application. arXiv, 2023.
[4] B. Girau and A. Upegui. Cellular Self-Organising Maps - CSOM. 13th Int. Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM), 2019.
[5] Y . Bernard et al. NP-SOM: network programmable self-organizing maps. IEEE 30th Int. Conf. on Tools with Artificial Intelligence (ICTAI), 2018.
[6] A. Upegui et al. Pruning Self-Organizing Maps for Cellular Hardware Architectures. 12th NASA/ESA Conference on Adaptive Hardware and Systems (AHS), 2018.
[7] H. Frezza-Buet. Online computing of non-stationary distributions velocity fields by an accuracy controlled growing neural gas. Neural Networks, 2014, 60, pp.203-221.
[8] B. Girau and C. Torres-Huitzil. Fault tolerance of Self-Organizing Maps. Neural Computing and Applications. Doi : 10.1007/s00521-018-3769-6, 2018.