👀Up next 
Building upon these findings, we've managed to externalize this internal mechanism, creating a general-purpose mention detector with promising results. Stay tuned! 🔜
               
            
            
                22.10.2025 08:16 — 👍 0    🔁 0    💬 0    📌 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            I'll be presenting this work at @blackboxnlp.bsky.social in Suzhou, happy to chat there or here if you are interested !
               
            
            
                22.10.2025 08:16 — 👍 0    🔁 0    💬 1    📌 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            3️⃣  The Entity Lens
Our method enables reconstruction of entity mentions from any representation within LLMs, allowing to ask: “What entity is the model thinking about right now?”  
💡 When reading ‘the City of Lights iconic monument’, the model internally “thinks” of Paris and the Eiffel Tower !
               
            
            
                22.10.2025 08:16 — 👍 0    🔁 0    💬 1    📌 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            2️⃣ LLMs develop entity-specific mechanisms. 
By sucessfully learning "Tasks Vectors" steering the model to reconstruct the mention, we uncover new evidence that LLMs form dedicated internal circuits to represent and manipulate multi-token entities.
               
            
            
                22.10.2025 08:16 — 👍 0    🔁 0    💬 1    📌 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            1️⃣ Common entities are (almost) part of the Vocabulary. 
We prove that common multi-token mentions (e.g. "Eiffel Tower") can be recovered from the middle-layer hidden state of its last token only !
Uncommon mentions aren't fully encoded this way; but rather retrieved from the context when needed.
               
            
            
                22.10.2025 08:16 — 👍 0    🔁 0    💬 1    📌 0                      
            
         
            
        
            
            
            
            
                                                 
                                                
    
    
    
    
            New paper at @blackboxnlp.bsky.social @ @emnlpmeeting.bsky.social ! 
⚛️ Entities are the fundamental building blocks of knowledge. Although some clues emerge from mechanistic interpretability, how auto-regressive LLMs actually encode and retrieve them remains a mystery. 🧵
📄 arxiv.org/abs/2510.09421
               
            
            
                22.10.2025 08:16 — 👍 1    🔁 1    💬 1    📌 0                      
            
         
    
         
        
            
        
                            
                    
                    
                                            Natural Language Processing and Computational Linguistics group at the University of Groningen 🐮
https://www.rug.nl/research/clcg/research/cl/
                                     
                            
                    
                    
                                            Postdoc @mlia_isir@sciences.re (Sorbonne Université, CNRS, ISIR)
 / Teacher @ aivancity
 / Teacher Assistant @ ENSAE
https://paullerner.github.io/
                                     
                            
                    
                    
                                            PhD student at Sorbonne University
                                     
                            
                    
                    
                                            PhD Student at @gronlp.bsky.social 🐮, core dev @inseq.org. Interpretability ∩ HCI ∩ #NLProc.
gsarti.com
                                     
                            
                    
                    
                                            http://cljournal.org
Computational Linguistics, established in 1974, is the official flagship journal of the Association for Computational Linguistics (ACL).
                                     
                            
                    
                    
                                            EMNLP 2025 - The annual Conference on Empirical Methods in Natural Language Processing
Dates: November 5-9, 2025 in Suzhou, China
Hashtags: #EMNLP2025 #NLP
Submission Deadline: May 19th, 2025
                                     
                            
                    
                    
                                            The largest workshop on analysing and interpreting neural networks for NLP. 
BlackboxNLP will be held at EMNLP 2025 in Suzhou, China
blackboxnlp.github.io
                                     
                            
                    
                    
                                            Directeur de recherche at Inria, former invited professor at Collège de France, co-founder of opensquare
                                     
                            
                    
                    
                                            Looking to start a post-doc in early 2025!
Working on the representations of LMs and pretraining methods 
 @InriaParis
https://nathangodey.github.io
                                     
                            
                    
                    
                                            ALMAnaCH, the Inria Paris NLP research team. 
                                     
                            
                    
                    
                                            The Conference of the European Chapter of the Association for Computational Linguistics
Next event: Rabat, Morocco, March 24-29, 2026
Hashtags: #EACL2026 #NLProc
                                     
                            
                    
                    
                                            Our in depth reporting on innovation reveals and explains what’s happening now to help you know what’s coming next. 
Find our journalists on Bluesky: https://bsky.app/starter-pack/technologyreview.com/3lar7fofuwl2n
                                     
                            
                    
                    
                                            Research Engineer at Sorbonne Université (MLIA team)
Computer Vision, Medical Imaging
                                     
                            
                    
                    
                                            PhD student in visual representation learning at Valeo.ai and Sorbonne Université (MLIA)
                                     
                            
                    
                    
                                            MLIA research team at CNRS/ISIR lab in Sorbonne University @sorbonne-universite.fr 
https://www.isir.upmc.fr/equipes/mlia/
                                     
                            
                    
                    
                                            CEO of Calicarpa. President of Tournesol🌻. ML security researcher. Science4All.
@Polytechnique X07, @polymtl PhD, ex @mit @epfl. 
Writer, @Orange AI ethics.
                                     
                            
                    
                    
                                            https://mega002.github.io
                                     
                            
                    
                    
                                            Fact-checking & investigation numérique.
Compte officiel. 
Un contenu à signaler ? ➡️ WhatsApp 📱 06 47 08 70 46
https://factuel.afp.com/
                                     
                            
                    
                    
                                            The Association for Computational Linguistics (ACL) is a scientific and professional organization for people working on Natural Language Processing/Computational Linguistics.
Hash tags: #NLProc #ACL2025NLP
                                     
                            
                    
                    
                                            International Conference on Learning Representations  https://iclr.cc/