David Steinberg's Avatar

David Steinberg

@david4096.bsky.social

I make stuff to help scientists focus on their research david4096.github.io

204 Followers  |  1,443 Following  |  46 Posts  |  Joined: 10.01.2025  |  1.6733

Latest posts by david4096.bsky.social on Bluesky

Cloud-Based BRCA Exchange Variant Analysis Environment Using GA4GH Standards in Camber By integrating BRCA Exchange variant data with GA4GH standards, this GA4GH Implementation Forum (GIF) project creates open, platform-agnostic workflows and tools that can be used by anyone for scalabl...

Announcing GIF Project: Cloud-based BRCA Exchange variant analysis environment using GA4GH standards in Camber. The project aims to adapt and extend community-driven standards to support interoperable workflows, variant annotation, and metadata description. Learn more: www.ga4gh.org/what-we-do/g...

22.07.2025 13:46 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Nice to meet you too!

04.04.2025 22:16 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Collected together @ga4gh.org Bluesky accounts here, lmk if you want to be added! go.bsky.app/8BDDMqM

03.04.2025 20:20 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Calling all @ga4gh.org Connect 2025 attendees online and in-person, let's connect here on bluesky! #ga4ghconnect2025 #ga4gh #bioinformatics #genomics

01.04.2025 13:59 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Capt. Grace Hopper on Future Possibilities: Data, Hardware, Software, and People (Part One, 1982)
YouTube video by National Security Agency Capt. Grace Hopper on Future Possibilities: Data, Hardware, Software, and People (Part One, 1982)

Grace Hopper could really get people laughing about information sciences and the struggles of working under strict hierarchies www.youtube.com/watch?v=si9i...

26.03.2025 21:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Summary of "Improvising to cellular playgrounds in Realtalk", Aug 2023
YouTube video by Dynamicland Summary of "Improvising to cellular playgrounds in Realtalk", Aug 2023

If you haven't caught up with the amazing new demos from @dynamicland.org now is your chance www.youtube.com/watch?v=Osn3...

12.03.2025 23:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

At the @mlcommons.org Croissant community meeting with, you guessed it

11.03.2025 17:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Improvising cellular playgrounds in Realtalk

The photo we saw reminded me immediately of some of the goals of @dynamicland.org as seen here dynamicland.org/2023/Improvi...

26.02.2025 16:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - dbcls/dive: Data Integration Visual Exploration (DIVE) Data Integration Visual Exploration (DIVE). Contribute to dbcls/dive development by creating an account on GitHub.

Another important direction is making immersive visual experiences that make data models accessible in a visual and humane way. I hope to experience this in person at a museum github.com/dbcls/dive

26.02.2025 16:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image Post image

Toshiyaki Katayama, original author of the wildly popular KEGG database rounding out the keynotes @swat4hcls.bsky.social by showing us the past, present, and future of linked data in the life sciences β€” lots of excitement for the possibilities of #graphgenome!!

26.02.2025 16:35 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Nice to see this one making the rounds @dockstore.org @ucscgenomics.bsky.social

25.02.2025 17:21 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Starter pack for #swat4hcls2025 conference go.bsky.app/PiZd2qR πŸ—£οΈ @swat4hcls.bsky.social

25.02.2025 17:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - MaastrichtU-IDS/UM_KEN4256_KnowledgeGraphs: Resources for the KG course at IDS, Maastricht University Resources for the KG course at IDS, Maastricht University - MaastrichtU-IDS/UM_KEN4256_KnowledgeGraphs

Slide from a course at @maastrichtu.bsky.social that’s up on GitHub github.com/MaastrichtU-...

25.02.2025 09:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Embedding knowledge graphs in order to compare ontologies using learned features from Shervin Mehryar’s keynote

25.02.2025 09:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

From Prof Anna Fensel’s keynote a roundup of some of the connections between AI and semantic

25.02.2025 09:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

One of the common themes of the conversations at #swat4hcls so far is that knowledge graphs are proving to be critical for reliability and interpretability of AI and LLMs in specific

25.02.2025 08:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Excited to attend #SWAT4HCLS in Barcelona next week, representing @cambercloud.bsky.social ! πŸŽ‰

At the hackathon, we’ll explore #CroissantML for seamless dataset & model access via @hf.co and @kaggle.com πŸ€“

22.02.2025 20:23 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Check out our first preprint from #biohacakathon Fukushima 2024 and expect more on this work πŸ€“ files.osf.io/v1/resources...

17.02.2025 22:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We found some low hanging fruit for improvement and tested out bringing a bio dataset into Croissant. We think that continually increasing the use of ontologies and controlled vocabularies will be crucial for data harmonization and the new era of multimodal models!

17.02.2025 22:02 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - david4096/croissant-rdf: Tools for working with RDF from Croissant JSON-LD resources Tools for working with RDF from Croissant JSON-LD resources - GitHub - david4096/croissant-rdf: Tools for working with RDF from Croissant JSON-LD resources

We made a simple tool for converting CroissantML to #RDF so it could be analyzed using #SPARQL and looked for differences between its usage between Kaggle and Hugging Face github.com/david4096/cr...

17.02.2025 22:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It works by providing a controlled vocabulary for high level dataset metadata as well as specific metadata for columnar data, which might seem like a small thing but is a huge step forward for bringing tools to data

17.02.2025 22:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@hf.co , @kaggle.com , OpenML, DataVerse and others are all implementing some or part of the CroissantML spec that interoperates with tooling like Tensorflow so you can load datasets directly into your AI training code

17.02.2025 22:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Biology datasets tend to be messy, require domain knowledge to parse, and not immediately usable for training AI models. That’s part of why I became interested in @mlcommons.org CroissantML as a way to bring ML tools to biology data β€” we’re presenting a poster on this effort at #swat4hcls next week!

17.02.2025 21:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This is a great opportunity to contribute β€”

13.02.2025 08:05 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

@anthropic.com marked bioinformaticians as Office & Administrative for their job category 🧐 www.anthropic.com/news/the-ant...

13.02.2025 04:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

gestures in @worrydream.com

10.02.2025 21:46 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

My feed is full of people who are struggling to understand how they are going to continue their research and my heart goes out to everyone. Doing basic research is already hard enough

01.02.2025 19:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yeah or computational/storage costs needed to reproduce results. That’s what piqued my interest

31.01.2025 16:53 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Appreciate your insights and tend to agree. I do like how they are trying to democratize publishing since I don’t think Elsevier’s of the world are adding much

31.01.2025 16:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Has anyone in my network tried @researchhubf.bsky.social ?

30.01.2025 06:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

@david4096 is following 20 prominent accounts