Talk submission video for Posit Conf 2026 #RStats
ยซOrganizing your data science team to be efficient, adaptive to new developments, and resilient to turnover rateยป
URLs:
1. lcolladotor.github.io/bioc_team_ds/
2. research.libd.org/rstatsclub/
3. github.com/LieberInstit...
#OpenSource #Bioconductor
20.02.2026 21:18 โ ๐ 10 ๐ 4 ๐ฌ 1 ๐ 0
RDM Weekly - Issue 031
A weekly roundup of Research Data Management resources.
Issue 31 of RDM Weekly is out! ๐ฌ
โก๏ธ OS resources in Comm Sciences @csdisseminate.bsky.social
โก๏ธ RDA 25th Plenary Meeting Programme @researchdataall.bsky.social
โก๏ธ Ask for R Help by Creating a Small Reproducible Example @libdrstats.bsky.social
and more!
rdmweekly.substack.com/p/rdm-weekly...
03.02.2026 14:05 โ ๐ 13 ๐ 6 ๐ฌ 0 ๐ 0
@lieberinstitute.bsky.social @jhubiostat.bsky.social
lcolladotor.github.io/rnaseq_LCG-U... is looking ready for next week!
We'll have the most guest presenters ever this year with @meli-mq99.bsky.social @gabrielramirezv.bsky.social @mariagtac.bsky.social & @daiannaglez.bsky.social ๐๐ฝ
See you soon at LCG-UNAM! ๐ฒ๐ฝ
#RStats @bioconductor.bsky.social
03.02.2026 05:21 โ ๐ 8 ๐ 3 ๐ฌ 0 ๐ 1
In a recent journal club, I covered the SOFA paper by @wkhuber.bsky.social and colleagues. It's especially cool how you can guide the factorization approach with known covariates.
Check out their preprint (doi.org/10.1101/2024...) and our discussion (youtu.be/WKs-g4lXzZs)!
08.01.2026 18:00 โ ๐ 7 ๐ 3 ๐ฌ 0 ๐ 0
A colleague Geo @geopertea.bsky.social found out a cool trick to get Positron working with full support at our computing cluster JHPCE (remote file explorer, Positron assistant seeing the remote R session). Game changer for my work at LIBD! Check out his demo video (youtu.be/CnmaJG8VPp8) #positron
27.01.2026 19:53 โ ๐ 7 ๐ 3 ๐ฌ 0 ๐ 0
noted that DuckDB won't perform worse than dplyr at any point โ
, and it's pretty easy to use ๐, so he plans to keep using duckplyr more often ๐.
He wrapped up by saying the benchmarks were interesting ๐, and despite the mixed results, he still expects to be using duckplyr a lot going forward! ๐ช๐
28.01.2026 02:59 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
9 hours โฐ๐ซ, and he was trying to optimize it.
Surprisingly, this particular use case didn't benefit much from DuckDB ๐ฆ, leading to his main takeaway: performance is highly dependent on what you're doing ๐ฏ. However, he
28.01.2026 02:59 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
@nick-eagles.bsky.social concluded that scratch space location matters significantly for performance ๐พโก. He explained his motivation: he had a real use case with hundreds of millions of rows ๐ that originally took
Full length video: youtu.be/ikumz_QLZiE
#RStats #dplyr #duckplyr #rpolars #data.table
28.01.2026 02:59 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
He noted that most benchmarks focus on speed โก, but emphasized that memory efficiency ๐ง is another important variable to consider - particularly highlighting that data.table tends to be more memory efficient than dplyr ๐พโจ.
#RStats #dplyr #duckplyr #rpolars #data.table
28.01.2026 02:50 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
@nick-eagles.bsky.social introduced his presentation on benchmarking #dplyr alternatives ๐๐ป. He planned to review public benchmarks of various data manipulation packages ๐ฆ as well as his own custom benchmark results ๐ฌ
Full length ๐ฅ: youtu.be/ikumz_QLZiE
#RStats #dplyr #duckplyr #rpolars #data.table
28.01.2026 02:50 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
actually be more rigorous ๐โ
, as he was trying to argue with the AI model ๐ค๐ฌ.
He wrapped up by saying "that's it" ๐ฌ, and then @nick-eagles.bsky.social enthusiastically responded that this is super useful ๐ and they're probably going to switch over to using Positron today! ๐๐
#RStats @posit.co
27.01.2026 21:22 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
@geopertea.bsky.social discussed how the AI provides semantic understanding of code ๐ง ๐ป, showing how objects are generated from different parts of the codebase ๐. He acknowledged that you can do this manually too ๐จโ๐ป, and sometimes manual inspection might ...
#RStats #PositronIDE @posit.co
27.01.2026 21:22 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
He pointed out - why not use AI to do that for you instead? ๐ค๐ก The AI quickly gave him the context of where it happens, functioning like a smart search tool โก๐ฏ!
#RStats
27.01.2026 21:12 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
He described another helpful scenario: tracing how a "log counts assay" object was generated ๐ฌ. Normally, you'd search for the filename ๐, then have to manually eyeball the context ๐ to see if that object was being read or written in a particular file ๐.
#RStats
27.01.2026 21:12 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
@geopertea.bsky.social showed his chat history ๐ and explained a useful use case: he asked the AI to look into a repository ๐ and tell him what methods they were using for calculations ๐งฎ. Basically, he had the AI read the code and summarize it for him ๐โจ.
#RStats #Positron #PositronAI
27.01.2026 21:12 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
Once everyone agrees on the plan โ
, the idea is to switch to agent mode ๐ so it has access to actually modify the code according to that plan ๐โจ. That's the intended workflow! ๐ฏ
#RStats #Positron
27.01.2026 21:04 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
He explained the workflow: when you start modifying files or code ๐ป, it should switch to a different agent - either a specific agent or the main agent โก. The planning agent is good for creating plans through discussion ๐ฌ between the LLM in the chat and the user ๐ฅ.
#RStats #Positron
27.01.2026 21:04 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
@geopertea.bsky.social explained how you can add your own custom agents ๐ค to #Positron & showed the interactive way to configure them ๐ ๏ธ. He demonstrated how the "plan" agent is defined, noting that it doesn't use any file mod tools ๐โ - it's specifically designed for planning purposes ๐
#RStats
27.01.2026 21:04 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
authentication code ๐ which he pasted into the authorization page ๐. He had to authorize the application to access his GitHub account โ
, going through two-factor authentication on his phone ๐ฑ. Once approved, he returned to Positron which confirmed the GitHub account was successfully connected! ๐โจ
27.01.2026 20:54 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
@geopertea.bsky.social demonstrated setting up #Positron IDE ๐ป by first detecting local interpreters and updating extensions ๐ง. He showed how to login to @github.com to activate GitHub Copilot connectivity ๐ค for the Positron assistant. During the new installation, the app provides an...
#RStats
27.01.2026 20:54 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
I explained why including context is crucial ๐! I pointed out that someone could just paste code like "GRanges from 1 to 1000 plus..." but then I would be confused ๐ asking "where does this function come from? What package is this from?" ๐ฆโ
@jennybryan.bsky.social #RStats #reprex
27.01.2026 20:16 โ ๐ 4 ๐ 2 ๐ฌ 1 ๐ 0
I demonstrated running a #reprex in a fresh #RStats session ๐ป & opened it in my browser ๐. It showed the ๐ฆ loading with all its messages, then the function itself. I pointed out that I had made a syntax error: I missed the colon before the plus sign โโ, which caused an error
@jennybryan.bsky.social
27.01.2026 20:06 โ ๐ 4 ๐ 2 ๐ฌ 1 ๐ 0
I explained how the reprex workflow helps with collaboration ๐ค! I described a scenario where I ask a question about why the mean is 2.5 and not 3, and Nina wants to help by running the same code ๐ป. She can easily copy-paste the #reprex output and it actually runs โ
!
@jennybryan.bsky.social #RStats
27.01.2026 19:58 โ ๐ 3 ๐ 2 ๐ฌ 1 ๐ 0
I demonstrated how to use the #reprex package ๐ฆ! I loaded it and explained the workflow: I copied lines of code from my console (pretending Leo has a question โ), then ran the reprex() function without any arguments ๐ป.
@jennybryan.bsky.social #RStats
27.01.2026 19:49 โ ๐ 3 ๐ 2 ๐ฌ 1 ๐ 0
I explained how a common problem makes it harder for people to get help ๐ค. People often think they've provided everything needed - posting their code online ๐ป, showing how to recreate objects, sharing all the code they ran, and even including the output ๐.
@jennybryan.bsky.social #RStats #reprex
27.01.2026 19:44 โ ๐ 3 ๐ 2 ๐ฌ 1 ๐ 0
I explained the common problem people encounter when asking for coding help ๐ค. I showed how someone might copy code they ran, paste it, ask a question. I was setting up to explain what's wrong with this approach โ of sharing code for troubleshooting purposes๐ป #RStats #reprex @jennybryan.bsky.social
27.01.2026 19:41 โ ๐ 3 ๐ 2 ๐ฌ 0 ๐ 0
YouTube video by Leonardo Collado Torres
[2020-04-03] How to ask for R help by creating a small reproducible example (edited on 2026-01-27)
I re-edited this 2020 video using Adobe Premier Pro's features for removing filler words, pauses, and text-based editing. In total it went down from 20:53 min to 12:41. I hope that you get to (re-)enjoy it ^_^.
@jennybryan.bsky.social
#RStats #reprex #reproducibility #JHPCE
youtu.be/LfUFxHFhXTE
27.01.2026 17:00 โ ๐ 4 ๐ 3 ๐ฌ 0 ๐ 0
Lots has evolved since these R Stats sessions @libdrstats.bsky.social, but we touch on the hot topic of integrating LLMs into data science, mostly with Github Copilot, in a couple sessions--
Introductory look: youtu.be/xuCQZ5_5KB0
Reusable prompts: youtu.be/mlMl81xRI_M
#RStats #LLMs
16.01.2026 18:22 โ ๐ 9 ๐ 3 ๐ฌ 0 ๐ 0
Passionate about bioinformatics, computers, life sciences, knowledge discovery and exploration (with AI now!), always searching for meaning (hopefully not only in the wrong places).
https://github.com/gpertea https://www.linkedin.com/in/geo-pertea/
Neuroscience Graduate student in โช
@kr-maynard.bsky.socialโฌ group โชat the @lieberinstitute.bsky.socialโฌ
Unraveling the mysteries of the Locus Coeruleus
Spatial transcriptomics enthusiast
Mapping brain complexity, one dataset at a time!
Researcher in neuroscience and psychiatric genomics, University of Exeter
Investigator Lieber Institute for Brain Development
Prof @BUmedicine Pharmacology, Physiology & Biophysics|
Director NeuroGenomics & Translational Bioinformatics @McleanHospital
Associate member @broadinstitute.org
https://daskalakislab.mclean.harvard.edu/
Computational Biologist, Head of research @ebi.embl.org, part-time
Heidelberg University, codirector DREAM challenges. For group's activities, see @saezlab.bsky.social
Account of the Saez-Rodriguez lab at EMBL-EBI and Heidelberg University. We integrate #omics data with mechanistic molecular knowledge into #opensource #ML methods
Website: https://saezlab.org/
GitHub: https://github.com/saezlab/
PhD Candidate @CSHL | Genomic Sciences @LCG-UNAM | https://reneegf.github.io/ | she/her | ๐ฒ๐ฝ
Investigator @lieberinstitute.bsky.social/ Associate Professor of Psychiatry and Behavioral Sciences @johnshopkins.bsky.social + Associate Professor of Biological Psychology and Cognitive Neuroscience @UNIBA https://shorturl.at/VVhk5.
Statistician doing genomic data science, faculty the University of Chicago, Korean, Argentinean, American. Love kimchi, math, science, books with beautiful prose.
Ph.D. student, Biostatistics, @jhubiostat.bsky.social
she/her
Website: sparthib.github.io/blog
#rstats ๐ for genomics ๐งฌ ๐ป
views are my own
Biomedical data scientist working at the intersection of machine learning ๐ค, computational omics ๐งฌ, and population health ๐จโ๐ฉโ๐งโ๐ฆ
๐จโ๐ป Assistant professor @University of Utah
๐ Personal website: https://boyi-guo.com/
๐ค Pronounce: he/him
Comms professional. Black Lives Matter. LGBTQ+ ally. Science & medicine are real. Animals are cute.
Computer Scientist | Using transcriptomic data to understand RNA splicing changes in human disease | Research Associate at the @RytenLab #UKDRI Cambridge | But also interested in cats, music and astronomy ๐๐โโฌ๐
Specialist in High-Dimensional Biological Data Analysis and Multiomics Integration
Anesthesia resident, physician scientist
genomics & addiction neurobiology
๐ &๐ฑ dad