Harit Vishwakarma @harit7 - Bluesky Profile

Latest posts by harit7.bsky.social on Bluesky

@srinathnamburi.bsky.social

11.12.2024 18:04 — 👍 1 🔁 0 💬 0 📌 0

Join us in the evening poster session (#1906) to learn more about it and chat about auto-labeling and data-centric AI.

Thanks to the amazing co-authors: Yi (Reid) Chen, Sui Jiet Tay, Srinath Namburi, @fredsala.bsky.social, Ramya Korlakai Vinayak.

11.12.2024 17:53 — 👍 2 🔁 0 💬 1 📌 0

Our method learns confidence functions tailored for efficient and reliable auto-labeling. Using these in TBAL boosts the no. of auto-labeled points by up to 60% (while making < 5% auto-labeling errors) compared to baselines like softmax and several training-time and post-hoc calibration techniques.

11.12.2024 17:53 — 👍 1 🔁 0 💬 1 📌 0

Introducing Colander, our framework for learning optimal confidence functions for TBAL! We formulate the auto-labeling objective as an optimization problem over the space of confidence functions and thresholds.

11.12.2024 17:53 — 👍 0 🔁 0 💬 1 📌 0

We systematically study the limitations of popular confidence functions like softmax outputs and off-the-shelf calibration techniques. The result? Too few auto-labeled points or large auto-labeling errors.

11.12.2024 17:53 — 👍 0 🔁 0 💬 1 📌 0

The choice confidence function is crucial in TBAL – if it's not aligned with the auto-labeling objective, it can be detrimental to performance. We show commonly used confidence functions fall short.

11.12.2024 17:53 — 👍 0 🔁 0 💬 1 📌 0

TBAL is a promising auto-labeling technique. It iteratively acquires human labels for small data chunks, trains a model, and auto-labels points where the model's confidence is above a threshold. The goal? Maximize coverage (proportion of auto-labeled points) with bounded auto-labeling error.

11.12.2024 17:53 — 👍 0 🔁 0 💬 1 📌 0

Excited to present Colander at #NeurIPS2024, our new framework for optimizing confidence functions to make auto-labeling more efficient and reliable. Check out our poster #1906 at today's evening poster session.

Wed, Dec 11, 4:30–7:30 p Poster #1906

Project: harit7.github.io/colander

11.12.2024 17:53 — 👍 4 🔁 2 💬 1 📌 0

@harit7 is following 15 prominent accounts

@icmlconf

Official account of ICML

Andreas Geiger
@andreasgeiger

Professor, University of Tübingen @unituebingen.bsky.social. Head of Department of Computer Science 🎓. Faculty, Tübingen AI Center 🇩🇪 @tuebingen-ai.bsky.social. ELLIS Fellow, Founding Board Member 🇪🇺 @ellis.eu. CV 📷, ML 🧠, Self-Driving 🚗, NLP 🖺

David Pfau
@davidpfau.com

So far I have not found the science, but the numbers keep on circling me. Views my own, unfortunately.

Xavier Alameda Pineda
@xavirema

Research Director @ Inria, Grenoble

Angeliki Giannou
@agg-gia

Gabe Orlanski
@gorlanski

PhD Student @ UW Madison

Jitian Zhao
@jtzhao

STAT PhD @ Wisc | Working on social network analysis & LLM adaptation

Jiayu (Mila) Wang
@jiayuwang

CS PhD @UW-Madison | Data- and compute- efficient, reasoning for foundation models Website: https://jiayuww.github.io/

Zachary Lipton
@zacharylipton

Cofounder & CTO @ Abridge, Raj Reddy Associate Prof of ML @ CMU, occasional writer, relapsing 🎷, creator of d2l.ai & approximatelycorrect.com

Mononito Goswami
@mononitogoswami

Ph.D. Student at Carnegie Mellon, Student Research at Google Formerly Applied Science Intern Amazon, Undergrad at Delhi Technological University 📈 Foundation Models for Structured Data (Time Series, Tabular), applications in healthcare.

Avi Trost
@atrost

PhD Student @UW-Madison, working on synthetic data, instruction tuning, and foundation models, @BrownUniversity '24 https://avitrost.github.io/

Fred Sala
@fredsala

Wisconsin CS. Snorkel AI. Working on machine learning & information theory. https://pages.cs.wisc.edu/~fredsala/

Nicholas Roberts
@nick11roberts

Ph.D. student at UW-Madison. Working on automating foundation model guided science. Previously at CMU, UCSD, Fresno City College. https://nick11roberts.science

Tzu-Heng (Brian) Huang
@zihengh1

CS Ph.D. Student @UWMadison. Research Intern @Apple AIML. Focusing on multimodal models, data curation, and data-centric AI. zihengh1.github.io

Bluesky
@bsky.app

official Bluesky account (check username👆) Bugs, feature requests, feedback: support@bsky.app