Florian Huber's Avatar

Florian Huber

@me-datapoint.bsky.social

Professor for data science at HSD, @zdd-hsd.bsky.social | ML fan & critic | current research mostly #datascience, #machinelearning, #cheminformatics #dataviz #nlp | โœจ #openscience #openaccess #rse | living data point ๐Ÿšฒ

2,105 Followers  |  602 Following  |  51 Posts  |  Joined: 08.09.2024  |  1.5248

Latest posts by me-datapoint.bsky.social on Bluesky

Preview
Bayerischer Landtag: Streit um Microsoft eskaliert Das bayerische Finanzministerium will weiterhin Microsoft-Produkte im Freistaat einsetzen und dafรผr einen millionenschweren Vertrag verlรคngern. Die Opposition will eine Abkehr vom Tech-Riesen und ford...

Das bayerische Finanzministerium will weiterhin Microsoft-Produkte einsetzen und dafรผr einen millionenschweren Vertrag verlรคngern. Die Opposition will eine Abkehr vom Tech-Riesen und fordert โ€ždigitale Souverรคnitรคtโ€œ. Doch auch das kรถnnte sich als Bumerang erweisen.

netzpolitik.org/2026/bayeris...

23.01.2026 14:56 โ€” ๐Ÿ‘ 106    ๐Ÿ” 33    ๐Ÿ’ฌ 16    ๐Ÿ“Œ 4

This is an excellent analogy, because my recollection from grade school is that the pen on the right looks fun and exciting, and then you play with it for a few minutes and realize it's not actually useful for anything and in fact makes some tasks more cumbersome, and never think about it again.

22.01.2026 12:21 โ€” ๐Ÿ‘ 10420    ๐Ÿ” 2699    ๐Ÿ’ฌ 190    ๐Ÿ“Œ 39
Preview
Abschied bis Herbst: Dรคnisches Digitalministerium kehrt Microsoft den Rรผcken Beim dรคnischen Digitalministerium sollen alle Angestellten ohne Microsoft auskommen. Stattdessen werde man Linux und LibreOffice nutzen, sagt die Ministerin.

The Danish Ministry of Digital Affairs is moving away from Microsoft and switching instead to Linux and LibreOffice
www.heise.de/news/Von-Wor...

20.01.2026 07:38 โ€” ๐Ÿ‘ 1354    ๐Ÿ” 512    ๐Ÿ’ฌ 33    ๐Ÿ“Œ 110

Considering I again did not receive the Nobel Prize for Economics, I no longer feel an obligation to pay back my debts.

19.01.2026 20:15 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Not my core expertise, but nonetheless also my (and most people's) business: Dependency on non-European digital services and infrastructure. Two days ago, the German Federal Ministry of the Inter... Not my core expertise, but nonetheless also my (and most people's) business: Dependency on non-European digital services and infrastructure. Two days ago, the German Federal Ministry of the Interior ...

I sometimes wonder how many more signs we (Europeans) need before we go all-in on European-based digital infrastructure.

Well, here is yet another report telling us why we shouldn't feel too comfortable when fully relying on US-based services:

www.linkedin.com/posts/f-hube...

#DigitalSovereignty

11.12.2025 09:16 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
#matchms #ms2query #ms2deepscore | Florian Huber I had an inspiring short trip to Wageningen University & Research at the end of November. First of all, to attend (and then celebrate!) the PhD defense of Niek de Jonge, and second, to join the Mini-S...

I had a fantastic short trip to @w-u-r.bsky.social at the end of November. First, to attend and celebrate the PhD defense of Niek de Jonge and second, to join the Mini-Symposium organized by @jjjvanderhooft.bsky.social !

See more on LinkedIn: www.linkedin.com/posts/f-hube...

09.12.2025 17:11 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
xcms in Peak Form: Now Anchoring a Complete Metabolomics Data Preprocessing and Analysis Software Ecosystem High-quality data preprocessing is essential for untargeted metabolomics experiments, where increasing data set scale and complexity demand adaptable, robust, and reproducible software solutions. Modern preprocessing tools must evolve to integrate seamlessly with downstream analysis platforms, ensuring efficient and streamlined workflows. Since its introduction in 2005, the xcms R package has become one of the most widely used tools for LC-MS data preprocessing. Developed through an open-source, community-driven approach, xcms maintains long-term stability while continuously expanding its capabilities and accessibility. We present recent advancements that position xcms as a central component of a modular and interoperable software ecosystem for metabolomics data analysis. Key improvements include enhanced scalability, enabling the processing of large-scale experiments with thousands of samples on standard computing hardware. These developments empower users to build comprehensive, customizable, and reproducible workflows tailored to diverse experimental designs and analytical needs. An expanding collection of tutorials, documentation, and teaching materials further supports both new and experienced users in leveraging broader R and Bioconductor ecosystems. These resources facilitate the integration of statistical modeling, visualization tools, and domain-specific packages, extending the reach and impact of xcms workflows. Together, these enhancements solidify xcms as a cornerstone of modern metabolomics research.

Out now! xcms in Peak Form: Now Anchoring a Complete Metabolomics Data Preprocessing and Analysis Software Ecosystem doi.org/10.1021/acs....
with Phillipine and @jorainer.bsky.social (EURAC), @metabomichael.bsky.social, Hendrik and Norman from @ipbhalle.bsky.social, @janstanstrup.bsky.social, et al.

08.12.2025 20:26 โ€” ๐Ÿ‘ 25    ๐Ÿ” 10    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Preview
Digi-Health Heroes - die Zukunft digitaler Gesundheit | Zentrum fรผr Digitalisierung und Digitalitรคt (ZDD) Die ZZDenkanstรถรŸe gehen in die nรคchste Runde! Ab jetzt laden wir wieder wรถchtenlich dazu ein, vor Ort oder online, unsere Forschenden und Projekte am ZDD kennenzulernen und รผber Themen der Digitalen T...

Relaunch unserer Vortragsreihe "ZDDenkanstรถรŸe". Zum Start mit Sabrina GroรŸkopp --> www.linkedin.com/posts/zdd-du...

รœber die nรคchsten Wochen folgen weitere Vortrรคge: zdd-duesseldorf.de

Kommt gerne vorbei!

18.11.2025 16:25 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Super useful new feature! I've started to recommend plotnine to everybody who does data analysis in python and needs a plotting solution, so this is very welcome.

20.10.2025 16:37 โ€” ๐Ÿ‘ 14    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
We still canโ€™t predict much of anything in biology Biology is hard. Yes, even for AI.

Biology is much more complicated than most non-biologists can imagine. And AI is not going to change this anytime soon.
blog.genesmindsmachines.com/p/we-still-c...

07.10.2025 16:11 โ€” ๐Ÿ‘ 173    ๐Ÿ” 68    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 6

Impressive milestone by @europarl.europa.eu to ban "veggie-burger" and other great dangers to humanity. 100 millions of confused meat-eaters can now finally navigate the menus again.

08.10.2025 14:19 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - matchms/matchms: Python library for processing (tandem) mass spectrometry data and for computing spectral similarities. Python library for processing (tandem) mass spectrometry data and for computing spectral similarities. - matchms/matchms

Special thanks to @julianpollmann.bsky.social and Niek de Jonge for code and code reviews!

GitHub: github.com/matchms/matc...

#opensource #RSE #researchsoftwareengineering

06.10.2025 15:59 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

New #matchms release (0.31)๐Ÿš€

With functionalities that were on our TODO list for a looooong time: Flash Entropy and BLINK scores! The new "FlashSimilarity" allows computing modified cosine, spectral entropy etc., about 100x faster (or more if you use Linux).

#Python #opensource #massspec

06.10.2025 15:59 โ€” ๐Ÿ‘ 6    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image Post image Post image

Ready for the 4th International Summer ๐ŸŒž School on Non-Target Metabolomics at DTU - Technical University of Denmark #Copenhagen organized by Martin Hansen & Scott Jarmusch with a team of local and international helpers and instructors ๐Ÿ˜Ž
Thanks Lone Gram for opening the school ๐Ÿ™Œ
#CompMetabolomics

18.08.2025 07:33 โ€” ๐Ÿ‘ 14    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Please stop saying โ€œThe Tanimoto similarity isโ€ โ€“ RDKit blog A simple tip to explain what you actually did

Today's #RDKit blog post is a heartfelt plea for clearer communication.
greglandrum.github.io/rdkit-blog/p...

17.07.2025 11:22 โ€” ๐Ÿ‘ 32    ๐Ÿ” 7    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

Great post!

We also noted the same thing, which triggered us to point out some pitfalls of various fingerprints --> www.biorxiv.org/content/10.1...

17.07.2025 11:40 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
BREAKING NEWS: AI coding may not be helping as much as you think Coding has been the strongest use case. But a new study from METR just dropped.

BREAKING NEWS: #AI coding may not be helping as much as you think

"But for now, the disconnnect between what coders thought they would get out of the tools efficiency-wise and what they actually did get out of them is cause for reevaluation." ~ @garymarcus.bsky.social

garymarcus.substack....

10.07.2025 23:13 โ€” ๐Ÿ‘ 38    ๐Ÿ” 11    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Preview
Paris cycling numbers double in one year thanks to massive investment and it's not stopping The report delves into the nuances of Parisian cycling culture, exploring the vibrant community of riders who navigate the city's streets

Paris cycling numbers double in one year thanks to massive investment and itโ€™s not stopping.
A visionary urban policy lead by @annehidalgo.bsky.social ๐Ÿ’ซ๐Ÿซถ๐Ÿป๐Ÿ™๐Ÿป
momentummag.com/paris-cyclin...

12.07.2025 08:26 โ€” ๐Ÿ‘ 386    ๐Ÿ” 122    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 18

Hier in @duesseldorf.bsky.social wird vorerst lieber noch jeder Parkplatz verteidigt...

(und leider nicht nur hier)

03.07.2025 21:08 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I donโ€™t think anyone is prepared for what they just did w/ ICE.

This is not a simple budget increase. It is an explosion - making ICE bigger than the FBI, US Bureau of Prisons, DEA,& others combined.

It is setting up to make whatโ€™s happening now look like childโ€™s play. And people are disappearing.

03.07.2025 18:58 โ€” ๐Ÿ‘ 97348    ๐Ÿ” 37859    ๐Ÿ’ฌ 4446    ๐Ÿ“Œ 2658
Preview
Verwaltung der Digitalisierung gestalten: Neue Arbeitsgruppe startet! | D64 โ€“ Zentrum fรผr digitalen Fortschritt Wir grรผnden am 17. Juli 2025 eine neue Arbeitsgruppe zur Verwaltungsdigitalisierung. Hier bringen wir digitale Kompetenz und politische Gestaltung zusammen.

Hey Verwaltungs-Digitalisierer:innen! Am 17. Juli starten wir eine neue AG zur Verwaltungsdigitalisierung. Eure Expertise aus dem รถffentlichen Dienst ist gefragt! Gemeinsam gestalten wir die Zukunft der รถffentlichen, digitalen Verwaltung ๐Ÿ’ช

d-64.org/veranstaltun...

01.07.2025 11:38 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
Effective data visualization strategies in untargeted metabolomics Covering: 2014 to 2023 for metabolomics, 2002 to 2023 for information visualization LC-MS/MS-based untargeted metabolomics is a rapidly developing research field spawning increasing numbers ofโ€ฆ

๐Ÿ”“Read in our MS Metabolomics themed collection, a #OpenAccess review from Kevin Mildau, Henry Ehlers, @jjjvanderhooft.bsky.social et al. at @w-u-r.bsky.socialโ€ฌ @tuwien.atโ€ฌ covering effective data visualization strategies in untargeted metabolomics #natprod

Find it here๐Ÿ”ฝ

26.06.2025 11:39 โ€” ๐Ÿ‘ 9    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image Post image Post image

@jorainer.bsky.social and @philouail.bsky.social gave a great overview of the ecosystem around #RforMassSpectrometry and #XCMS!

#MetSoc25
I am super glad they now also provide options to combine with #Python and #matchms (thanks๐Ÿ™)

26.06.2025 09:32 โ€” ๐Ÿ‘ 11    ๐Ÿ” 6    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿ“ข Poster 1001 at #MetSoc2025: Marilyn De Graeve on our #SpectriPy #rstats package to integrate #python and #rstats packages for #MassSpec data analysis . TODAY

23.06.2025 11:09 โ€” ๐Ÿ‘ 30    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Hi, in case your phone didn't pick up the QR code to the slides of my Hitch-Hikers Guide to Computational Metabolomics talk this morning at #Metabolomics2025, featuring #xcms, #massbank, not #metfrag but #CASMI and #MetFamily, please find them at doi.org/10.5281/zeno...

25.06.2025 09:15 โ€” ๐Ÿ‘ 16    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Slide from presentation of Steffen Neumann

Slide from presentation of Steffen Neumann

Great keynote by @sneumann.bsky.social at #MetSoc25, strongly advocating for #opensource , data-sharing, and making things interoperable.

Glad to also spot #matchms in this universe :)

25.06.2025 07:35 โ€” ๐Ÿ‘ 18    ๐Ÿ” 4    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Post image

Proud of Niek de Jonge who did a fantastic job in presenting his work on cross-ion mode spectral similarity scoring! ๐Ÿ˜Ž ๐Ÿ‘
Work with Florian Huber @me-datapoint.bsky.social

#metabolomics #CompMetabolomics #MetSoc25 #MS2DeepScore

23.06.2025 21:46 โ€” ๐Ÿ‘ 20    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Chemical Space Visualizations using UMAP and various molecular fingerprints.

Chemical Space Visualizations using UMAP and various molecular fingerprints.

4/4
We also highlight options for count fingerprints, such as log-counts and IDF weighted counts. The latter can be used to adjust the bit importance to a dataset of your choice.

An example use-case are chemical space visualizations.

Preprint: www.biorxiv.org/content/10.1...

23.06.2025 09:22 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

3/4
A huge issue is bit collisions.
Fingerprints with a high bit occupation (RDKit, MAP4) often lead to (1) arbitrary misinterpretations, (2) shifts to high Tanimoto scores, (3) very different handling of small and large molecules.

--> Consider using sparse fingerprints!
--> Morgan >> MAP4 / RDKit

23.06.2025 09:22 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Benchmarking plot on fingerprint duplications.

Benchmarking plot on fingerprint duplications.

2/4
We focused on weaknesses of the fingerprints.
Many show frequent duplicates, so same fingerprint for different compounds. Most problematic: this can include *very* different compounds ending up with identical fingerprints.

- MAP4 >> Morgan-type >> daylight
- count >> binary

#cheminformatics

23.06.2025 09:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@me-datapoint is following 20 prominent accounts