Benjamin Feuer's Avatar

Benjamin Feuer

@benjaminfeuer.bsky.social

PhD researcher at NYU, working on LLMs, VLMs, and tabular foundation models from a data-centric perspective. Father of two, NYC diehard.

84 Followers  |  58 Following  |  23 Posts  |  Joined: 20.11.2024  |  2.0846

Latest posts by benjaminfeuer.bsky.social on Bluesky

Special thanks to the BlueSky DCVLR crew: @yuhuiz.bsky.social @thaottn.bsky.social @vishaalurao.bsky.social @saining.bsky.social @sarameghanbeery.bsky.social

18.06.2025 14:27 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
DCVLR: Data Curation for Vision Language Reasoning - NeurIPS 2025 Competition Join the DCVLR NeurIPS 2025 Competition. Advance visual reasoning in VLMs through data curation.

Check out:

Our Website: dcvlr-neurips.github.io

Our Starter Kit (Curate, Train, Eval): github.com/oumi-ai/oumi...

🧡 6 / n

18.06.2025 14:22 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

* A submission = a curated reasoning dataset on @huggingface with 1k or 10k samples and a scalable, reproducible curation strategy you document in a write-up
* You don’t need to train a model
* You can submit with nothing more than a free Colab or Kaggle account for basic testing

🧡 5 / n

18.06.2025 14:22 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ’ͺanyone can compete for free πŸ’ͺ: Thanks to our sponsor @LambdaAPI we offer three free submissions for up to 500 teams. This is unprecedented in data-centric research, which tends to be very expensive because you have to train lots of models!

🧡 4 / n

18.06.2025 14:21 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ€– open-models πŸ€–: every model we present results for will have open weights, and one of those models will be Molmo-O from @allen_ai (a recent best paper honorable mention from @cvpr at #CVPR2025), trained on open data.

🧡 3 / n

18.06.2025 14:20 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

DCVLR is data-centric: we train an ~7B VLM on your dataset. The best performer (on benchmarks like MathVista, VMCBench and LiveXiv) will be eligible to win $1500 and a talk at #NeurIPS2025!

We also have a few twists compared to prior data-centric competitions –

🧡 2 / n

18.06.2025 14:20 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
DCVLR: Data Curation for Vision Language Reasoning - NeurIPS 2025 Competition Join the DCVLR NeurIPS 2025 Competition. Advance visual reasoning in VLMs through data curation.

So excited to announce the DCVLR (Data Curation for Vision-Language Reasoning) competition at #NeurIPS2025, led by @oumi-pbc.bsky.social and Lambda AI!

🌟open-data 🌟
πŸ€– open-models πŸ€–
πŸ’» open-source πŸ’»
πŸ’ͺanyone can compete for free πŸ’ͺ

dcvlr-neurips.github.io

🧡 1 / n

18.06.2025 14:18 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 1    πŸ“Œ 2

Co-organizing with wonderful collaborators from MIT, NYU, Stanford and UW: @thaottn.bsky.social , @sewoong79.bsky.social , @sarameghanbeery.bsky.social , @yuhuiz.bsky.social !

01.05.2025 17:04 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

We are excited to be sponsored by @datologyai.com
, who will be providing prizes for best paper awards πŸ†

01.05.2025 17:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸš€We welcome any submission that discusses domain-specific data curation pipelines and/or generalizable curation principles, getting us closer to building data-centric methods that are robust, efficient, and adaptable across domains.

Refer to our website for the call for papers!

01.05.2025 17:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
ICML 2025 Workshop on Unifying Data Curation Frameworks Across Domains ICML 2025 Workshop on Unifying Data Curation Frameworks Across Domains

πŸ“’ Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains!

πŸ“… Deadline: May 24, AoE
πŸ”— Website: dataworldicml2025.github.io

We have an amazing lineup of speakers + panelists from various institutions and application areas!

01.05.2025 17:01 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 3    πŸ“Œ 2

That's not what they did, they used gpt-4o for program synthesis, it's fundamentally different than asking the LLM to provide the correct response in the prompt

22.12.2024 11:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Thanks for sharing! FWIW, I sensed mostly optimism and excitement at NeurIPS -- the people I spoke to were eager to talk about their research and learn about mine. Let's meet up in the new year and compare notes @kyunghyuncho.bsky.social

22.12.2024 11:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That does seem like a sound rule! Although, interestingly, they did not apply it to me. πŸ˜…

14.12.2024 15:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Hello Vancouver!

11.12.2024 16:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
SOCIAL MEDIA TITLE TAG SOCIAL MEDIA DESCRIPTION TAG TAG

or AI for science!

baskargroup.github.io/BioTrove/

08.12.2024 21:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Ben Feuer on LinkedIn: GitHub - penfever/TuneTables: TuneTables is a tabular classifier that… Very happy to report that our paper describing TuneTables, a new tabular classification and regression model, will appear at #NeurIPS 2024! 🎊 Built on the…

Or tabular deep learning ...

www.linkedin.com/posts/benjam...

08.12.2024 21:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Statistics in LLMs - Schedule Saturday, December 14th, 2024

Or LLMs ...

sites.google.com/berkeley.edu...

08.12.2024 21:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
NeurIPS Poster ImageNet++: A Large-Scale Benchmark of Data Curation StrategiesNeurIPS 2024

NeurIPS folks, excited to connect next week at the conference!

HMU to talk about VLMs ...

neurips.cc/virtual/2024...

08.12.2024 21:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Unpopular opinion: the #ICLR2025 reviews were better quality than in the last few years.

I think its mainly because they had people review fewer papers.

Opinions?

28.11.2024 14:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
The Righteous Mind: Why Good People Are Divided by Politics and Religion The Righteous Mind: Why Good People Are Divided by Politics and Religion [Haidt, Jonathan] on Amazon.com. *FREE* shipping on qualifying offers. The Righteous Mind: Why Good People Are Divided by Politics and Religion

This book helped me learn how to understand ideological and inconsistent intellectual stances (a bit)

www.amazon.com/Righteous-Mi...

28.11.2024 14:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I feel like my Macbook Pro battery is starting to go; it used to last all day, now it's dead by the afternoon. The thing is only 2.5 years old. 🀨

27.11.2024 11:45 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Excited to be making my first post on BlueSky! Let's talk AI research.

@eugenevinitsky.bsky.social, can I get a who's who on here? :-)

20.11.2024 10:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@benjaminfeuer is following 20 prominent accounts