Ultimately, our results reveal a key limitation of current crop-based representation learning models for single-cell analysis, and underscore the need for methods that learn background-invariant single-cell features.
               
            
            
                04.07.2025 21:50 β π 0    π 0    π¬ 1    π 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            The background sensitivity of models has consequences for reliable single-cell analysis.
When background cells are masked, predicted localization distributions for multi-localized proteins, or proteins where localization changes from cell to cell, differ substantially for ~15% of proteins.
               
            
            
                04.07.2025 21:50 β π 0    π 0    π¬ 1    π 0                      
            
         
            
        
            
            
            
            
                                                
                                                
    
    
    
    
            We find that three leading single-cell modelsβPIFiA, Paired Cell Inpainting, and DeepLocβare sensitive to background context.
Classification accuracy drops by up to 15.8% when background cell localization differs from the center cell, and improves when they match.
               
            
            
                04.07.2025 21:50 β π 0    π 0    π¬ 1    π 0                      
            
         
            
        
            
            
            
            
                                                
                                                
    
    
    
    
            To assess if background alters model behavior, we evaluate five settings where the center cell is fixed while the background is either unaltered, swapped with backgrounds from the same or different protein localization classes, replaced with a cell-free background, or masked out.
               
            
            
                04.07.2025 21:50 β π 0    π 0    π¬ 1    π 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            We focus on representation learning models trained to capture protein localization, the subcellular component within the cell where a protein resides, in yeast cells.
               
            
            
                04.07.2025 21:50 β π 0    π 0    π¬ 1    π 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            This matters because understanding single-cell variability is key in biology. If models are altering their behavior depending on background, they are not robust for biological analyses. With @alexijie.bsky.social and Alan Moses, we introduce a framework to systematically test for this.
               
            
            
                04.07.2025 21:50 β π 0    π 0    π¬ 1    π 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            Deep learning methods are widely used to analyze microscopy images. But many operate on crops of images that, while centered around a single cell, may include other cells in the background, raising the question:
Are these models truly capturing single-cell features?
               
            
            
                04.07.2025 21:50 β π 1    π 0    π¬ 1    π 0