LIBD rstats club's Avatar

LIBD rstats club

@libdrstats.bsky.social

Researchers at the @lieberinstitute.bsky.social learning, blogging & talking about #rstats, ๐Ÿ“ฆs, @Bioconductor.bsky.social, #bioinformatics (our own opinions) Admin: @lcolladotor.bsky.social http://lieberinstitute.github.io/rstatsclub/

245 Followers  |  92 Following  |  28 Posts  |  Joined: 20.09.2023  |  1.9834

Latest posts by libdrstats.bsky.social on Bluesky


Video thumbnail

Talk submission video for Posit Conf 2026 #RStats

ยซOrganizing your data science team to be efficient, adaptive to new developments, and resilient to turnover rateยป

URLs:
1. lcolladotor.github.io/bioc_team_ds/
2. research.libd.org/rstatsclub/
3. github.com/LieberInstit...

#OpenSource #Bioconductor

20.02.2026 21:18 โ€” ๐Ÿ‘ 10    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
RDM Weekly - Issue 031 A weekly roundup of Research Data Management resources.

Issue 31 of RDM Weekly is out! ๐Ÿ“ฌ

โžก๏ธ OS resources in Comm Sciences @csdisseminate.bsky.social
โžก๏ธ RDA 25th Plenary Meeting Programme @researchdataall.bsky.social
โžก๏ธ Ask for R Help by Creating a Small Reproducible Example @libdrstats.bsky.social
and more!

rdmweekly.substack.com/p/rdm-weekly...

03.02.2026 14:05 โ€” ๐Ÿ‘ 13    ๐Ÿ” 6    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
@lieberinstitute.bsky.social @jhubiostat.bsky.social

@lieberinstitute.bsky.social @jhubiostat.bsky.social

lcolladotor.github.io/rnaseq_LCG-U... is looking ready for next week!

We'll have the most guest presenters ever this year with @meli-mq99.bsky.social @gabrielramirezv.bsky.social @mariagtac.bsky.social & @daiannaglez.bsky.social ๐Ÿ™Œ๐Ÿฝ

See you soon at LCG-UNAM! ๐Ÿ‡ฒ๐Ÿ‡ฝ

#RStats @bioconductor.bsky.social

03.02.2026 05:21 โ€” ๐Ÿ‘ 8    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

In a recent journal club, I covered the SOFA paper by @wkhuber.bsky.social and colleagues. It's especially cool how you can guide the factorization approach with known covariates.

Check out their preprint (doi.org/10.1101/2024...) and our discussion (youtu.be/WKs-g4lXzZs)!

08.01.2026 18:00 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

A colleague Geo @geopertea.bsky.social found out a cool trick to get Positron working with full support at our computing cluster JHPCE (remote file explorer, Positron assistant seeing the remote R session). Game changer for my work at LIBD! Check out his demo video (youtu.be/CnmaJG8VPp8) #positron

27.01.2026 19:53 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

noted that DuckDB won't perform worse than dplyr at any point โœ…, and it's pretty easy to use ๐Ÿ‘, so he plans to keep using duckplyr more often ๐Ÿš€.

He wrapped up by saying the benchmarks were interesting ๐Ÿ“ˆ, and despite the mixed results, he still expects to be using duckplyr a lot going forward! ๐Ÿ’ช๐Ÿ˜Š

28.01.2026 02:59 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

9 hours โฐ๐Ÿ˜ซ, and he was trying to optimize it.

Surprisingly, this particular use case didn't benefit much from DuckDB ๐Ÿฆ†, leading to his main takeaway: performance is highly dependent on what you're doing ๐ŸŽฏ. However, he

28.01.2026 02:59 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

@nick-eagles.bsky.social concluded that scratch space location matters significantly for performance ๐Ÿ’พโšก. He explained his motivation: he had a real use case with hundreds of millions of rows ๐Ÿ“Š that originally took

Full length video: youtu.be/ikumz_QLZiE
#RStats #dplyr #duckplyr #rpolars #data.table

28.01.2026 02:59 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

He noted that most benchmarks focus on speed โšก, but emphasized that memory efficiency ๐Ÿง  is another important variable to consider - particularly highlighting that data.table tends to be more memory efficient than dplyr ๐Ÿ’พโœจ.

#RStats #dplyr #duckplyr #rpolars #data.table

28.01.2026 02:50 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

@nick-eagles.bsky.social introduced his presentation on benchmarking #dplyr alternatives ๐Ÿ“Š๐Ÿ’ป. He planned to review public benchmarks of various data manipulation packages ๐Ÿ“ฆ as well as his own custom benchmark results ๐Ÿ”ฌ

Full length ๐ŸŽฅ: youtu.be/ikumz_QLZiE
#RStats #dplyr #duckplyr #rpolars #data.table

28.01.2026 02:50 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

actually be more rigorous ๐Ÿ”โœ…, as he was trying to argue with the AI model ๐Ÿค–๐Ÿ’ฌ.

He wrapped up by saying "that's it" ๐ŸŽฌ, and then @nick-eagles.bsky.social enthusiastically responded that this is super useful ๐ŸŒŸ and they're probably going to switch over to using Positron today! ๐Ÿš€๐Ÿ˜„

#RStats @posit.co

27.01.2026 21:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

@geopertea.bsky.social discussed how the AI provides semantic understanding of code ๐Ÿง ๐Ÿ’ป, showing how objects are generated from different parts of the codebase ๐Ÿ”—. He acknowledged that you can do this manually too ๐Ÿ‘จโ€๐Ÿ’ป, and sometimes manual inspection might ...

#RStats #PositronIDE @posit.co

27.01.2026 21:22 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

He pointed out - why not use AI to do that for you instead? ๐Ÿค–๐Ÿ’ก The AI quickly gave him the context of where it happens, functioning like a smart search tool โšก๐ŸŽฏ!

#RStats

27.01.2026 21:12 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

He described another helpful scenario: tracing how a "log counts assay" object was generated ๐Ÿ”ฌ. Normally, you'd search for the filename ๐Ÿ”Ž, then have to manually eyeball the context ๐Ÿ‘€ to see if that object was being read or written in a particular file ๐Ÿ“„.

#RStats

27.01.2026 21:12 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

@geopertea.bsky.social showed his chat history ๐Ÿ“œ and explained a useful use case: he asked the AI to look into a repository ๐Ÿ” and tell him what methods they were using for calculations ๐Ÿงฎ. Basically, he had the AI read the code and summarize it for him ๐Ÿ“โœจ.

#RStats #Positron #PositronAI

27.01.2026 21:12 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Once everyone agrees on the plan โœ…, the idea is to switch to agent mode ๐Ÿ”„ so it has access to actually modify the code according to that plan ๐Ÿ“โœจ. That's the intended workflow! ๐ŸŽฏ

#RStats #Positron

27.01.2026 21:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

He explained the workflow: when you start modifying files or code ๐Ÿ’ป, it should switch to a different agent - either a specific agent or the main agent โšก. The planning agent is good for creating plans through discussion ๐Ÿ’ฌ between the LLM in the chat and the user ๐Ÿ‘ฅ.

#RStats #Positron

27.01.2026 21:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

@geopertea.bsky.social explained how you can add your own custom agents ๐Ÿค– to #Positron & showed the interactive way to configure them ๐Ÿ› ๏ธ. He demonstrated how the "plan" agent is defined, noting that it doesn't use any file mod tools ๐Ÿ“โŒ - it's specifically designed for planning purposes ๐Ÿ“‹

#RStats

27.01.2026 21:04 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

authentication code ๐Ÿ”‘ which he pasted into the authorization page ๐ŸŒ. He had to authorize the application to access his GitHub account โœ…, going through two-factor authentication on his phone ๐Ÿ“ฑ. Once approved, he returned to Positron which confirmed the GitHub account was successfully connected! ๐ŸŽ‰โœจ

27.01.2026 20:54 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

@geopertea.bsky.social demonstrated setting up #Positron IDE ๐Ÿ’ป by first detecting local interpreters and updating extensions ๐Ÿ”ง. He showed how to login to @github.com to activate GitHub Copilot connectivity ๐Ÿค– for the Positron assistant. During the new installation, the app provides an...

#RStats

27.01.2026 20:54 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

I explained why including context is crucial ๐Ÿ“š! I pointed out that someone could just paste code like "GRanges from 1 to 1000 plus..." but then I would be confused ๐Ÿ˜• asking "where does this function come from? What package is this from?" ๐Ÿ“ฆโ“

@jennybryan.bsky.social #RStats #reprex

27.01.2026 20:16 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

I demonstrated running a #reprex in a fresh #RStats session ๐Ÿ’ป & opened it in my browser ๐ŸŒ. It showed the ๐Ÿ“ฆ loading with all its messages, then the function itself. I pointed out that I had made a syntax error: I missed the colon before the plus sign โž•โŒ, which caused an error

@jennybryan.bsky.social

27.01.2026 20:06 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

I explained how the reprex workflow helps with collaboration ๐Ÿค! I described a scenario where I ask a question about why the mean is 2.5 and not 3, and Nina wants to help by running the same code ๐Ÿ’ป. She can easily copy-paste the #reprex output and it actually runs โœ…!

@jennybryan.bsky.social #RStats

27.01.2026 19:58 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

I demonstrated how to use the #reprex package ๐Ÿ“ฆ! I loaded it and explained the workflow: I copied lines of code from my console (pretending Leo has a question โ“), then ran the reprex() function without any arguments ๐Ÿ’ป.

@jennybryan.bsky.social #RStats

27.01.2026 19:49 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

I explained how a common problem makes it harder for people to get help ๐Ÿ˜ค. People often think they've provided everything needed - posting their code online ๐Ÿ’ป, showing how to recreate objects, sharing all the code they ran, and even including the output ๐Ÿ“Š.

@jennybryan.bsky.social #RStats #reprex

27.01.2026 19:44 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

I explained the common problem people encounter when asking for coding help ๐Ÿค”. I showed how someone might copy code they ran, paste it, ask a question. I was setting up to explain what's wrong with this approach โŒ of sharing code for troubleshooting purposes๐Ÿ’ป #RStats #reprex @jennybryan.bsky.social

27.01.2026 19:41 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
[2020-04-03] How to ask for R help by creating a small reproducible example (edited on 2026-01-27)
YouTube video by Leonardo Collado Torres [2020-04-03] How to ask for R help by creating a small reproducible example (edited on 2026-01-27)

I re-edited this 2020 video using Adobe Premier Pro's features for removing filler words, pauses, and text-based editing. In total it went down from 20:53 min to 12:41. I hope that you get to (re-)enjoy it ^_^.

@jennybryan.bsky.social

#RStats #reprex #reproducibility #JHPCE

youtu.be/LfUFxHFhXTE

27.01.2026 17:00 โ€” ๐Ÿ‘ 4    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Lots has evolved since these R Stats sessions @libdrstats.bsky.social, but we touch on the hot topic of integrating LLMs into data science, mostly with Github Copilot, in a couple sessions--

Introductory look: youtu.be/xuCQZ5_5KB0
Reusable prompts: youtu.be/mlMl81xRI_M

#RStats #LLMs

16.01.2026 18:22 โ€” ๐Ÿ‘ 9    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Keri Martinowich, Ph.D., Chief Scientific Officer | The Lieber Institute for Brain Development | Research & Discovery Lieber Institute for Brain Development Appoints Keri Martinowich, Ph.D., as Chief Scientific Officer Baltimore, MD (January 6, 2025) โ€” The Lieber Institute for Brain Development (LIBD) announced the a...

Honored to be appointed Chief Scientific Officer @lieberinstitute.bsky.social. LIBD is a uniquely collaborative place with extraordinary biological resources and a commitment to translation ๐Ÿง ๐Ÿ”ฌ๐Ÿงช. Grateful to my colleagues and excited to help guide our next chapter!
www.libd.org/keri-martino...

09.01.2026 13:07 โ€” ๐Ÿ‘ 205    ๐Ÿ” 25    ๐Ÿ’ฌ 46    ๐Ÿ“Œ 3
Post image Post image Post image

PS You can see Cynthia's growth on @github.com by comparing the number of commits per year in 2022 (before joining @lieberinstitute.bsky.social), then the end of 2023, to 2025 (she spent nearly all year with us) ๐Ÿ“ˆ

17.12.2025 23:32 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@libdrstats is following 20 prominent accounts