Forbes 30 Under 30 2025: Healthcare & Science
Discovering new worlds, in our cells and outer space.
Honored to be named to the Forbes 30 Under 30 Asia 2025 in Science!
Grateful for the recognition of my Ph.D. work on Retrieval-Augmented LMs, and excited to keep pushing the boundaries of reliable and efficient language models.
π forbes.com/30-under-30/...
More updates soonβ¦ π
16.05.2025 14:02 β π 9 π 0 π¬ 0 π 0
Sad to miss #ICLR2025 this year, but my amazing co-authors will be there in person to present Pangea!
neulab.github.io/Pangea/
Iβll be at the Foundation Models for Science conference at Simons Foundation, NYC next week, then heading to NAACL (more details soon).
Letβs catch up if youβre around!β¨
22.04.2025 00:42 β π 7 π 1 π¬ 0 π 0
Real user queries often look different from the clean, concise ones in academic benchmarks - ambiguity, full of typos, and much less readable.
We show that even strong RAG systems quickly break under these conditions.
Awesome project led by
@neelbhandari.bsky.social and @tianyucao.bsky.social!!
22.04.2025 00:27 β π 6 π 1 π¬ 0 π 0
31% of US adults use generative AI for healthcare π€―But most AI systems answer questions assertivelyβeven when they donβt have the necessary context. Introducing #MediQ a framework that enables LLMs to recognize uncertaintyπ€and ask the right questionsβwhen info is missing: π§΅
06.12.2024 22:51 β π 68 π 14 π¬ 2 π 2
The 2nd Workshop on Regulatable ML @NeurIPS2024
Towards Bridging the Gaps between Machine Learning Research and Regulations
CopyBench (EMNLP 2024, led by @tomchen0112.bsky.social)
Oral at regulatableml.github.io & Poster at redteaming-gen-ai.github.io
tldr: We benchmarked LLMs' literal/non-literal copying of copyrighted contentβrisks found even in 8B models.
Detais: www.arxiv.org/abs/2407.07087
08.12.2024 02:55 β π 2 π 0 π¬ 0 π 0
MassiveDS (led by @rulinshao.bsky.social) Wednesday Poster at 11-2 pm at West Ballroom#7203
TLDR: We demonstrated scaling retrieval corpora of Retrieval-Augmented LMs to 1.4T helps & achieves more compute-optimal scaling
Details: retrievalscaling.github.io
08.12.2024 02:54 β π 2 π 0 π¬ 1 π 0
Excited to attend #NeurIPS2024 in person! Iβll be presenting MassiveDS and CopyBench. Details below π§΅π
Letβs catch up and chat about:
- LLMs & Retrieval-Augmented/Augmented LMs
- LLM Applications for science (e.g., OpenScholar) & others
- Ph.D./faculty apps
...and more!
08.12.2024 02:52 β π 24 π 1 π¬ 1 π 0
Oh that's a screenshot of my website. Here's link to my CV akariasai.github.io/assets/pdf/a...
06.12.2024 04:34 β π 1 π 0 π¬ 1 π 0
Akari Asai
A 5th year Ph.D. student at University of Washington, focusing on NLP and ML.
I would love to hear about any opportunities that might be a good fit!! You can find my contact info and CV on my website. akariasai.github.io. I am attending NeurIPS in person so letβs chat!
04.12.2024 13:31 β π 0 π 0 π¬ 1 π 0
π Recognition & Impact: My work has earned EECS Rising Stars 2022, the MIT Tech Review Innovator Award (Japan 2024), paper awards at ACL & NeurIPS, and the IBM Fellowship. My work has been featured in medias like MIT Tech Review, Forbes and VentureBeat.
04.12.2024 13:31 β π 1 π 0 π¬ 1 π 0
Ai2 OpenScholar: Scientific literature synthesis with retrieval-augmented language models | Ai2
Ai2βs & UWβs OpenScholar, a retrieval-augmented LM, helps scientists navigate and synthesize scientific literature.
π Making Real-World Impacts
Retrieval-Augmented LMs tackle critical challenges like:
1οΈβ£ Unreliable LMs in expert domains
2οΈβ£ Information access inequity across languages
I launched OpenScholar for scientific synthesisβ20k+ demo requests in week 1! Details: allenai.org/blog/opensch...
04.12.2024 13:30 β π 0 π 0 π¬ 1 π 0
Self-RAG: Learning to Retrieve, Generate and Critique through Self-Reflection
Self-RAG: Learning to Retrieve, Generate and Critique through Self-Reflection.
π Building the Foundations:
Retrieval-augmented LMs need more than off-the-shelf models. I developed advanced training/inference algorithms & architectures, including Self-RAG (ICLR 2024 Oral; NeurIPS Workshop Hon. Mention) for adaptive retrieval & self-critique.
Learn more:
selfrag.github.io
04.12.2024 13:30 β π 0 π 0 π¬ 1 π 0
Iβm on the academic job market this year! Iβm completing my @uwcse.bsky.social @uwnlp.bsky.social Ph.D. (2025), focusing on overcoming LLM limitations like hallucinations, by building new LMs.
My Ph.D. work focuses on Retrieval-Augmented LMs to create more reliable AI systems π§΅
04.12.2024 13:26 β π 70 π 17 π¬ 3 π 2
congrats @akariasai.bsky.social:
π¬ retrieval augmented LM for science literature
𧬠open data, weights, index, code, etc
βοΈ new eval suite for science literature tasks
π demo to play w the model
encourage checking out to see what scientific LMs can/cant do today w open research artifacts
19.11.2024 16:58 β π 18 π 1 π¬ 2 π 0
Super exciting RAG prototype @akariasai.bsky.social build on top of Semantic Scholar!
I love how it returns competent research answers for seemingly out CS domain questions, eg βwhatβs a bell?β openscholar.allen.ai/query/69cf13...
itβs good in domain too π
19.11.2024 16:59 β π 20 π 1 π¬ 0 π 0
A photo of Boulder, Colorado, shot from above the university campus and looking toward the Flatirons.
I'm recruiting 1-2 PhD students to work with me at the University of Colorado Boulder! Looking for creative students with interests in #NLP and #CulturalAnalytics.
Boulder is a lovely college town 30 minutes from Denver and 1 hour from Rocky Mountain National Park π
Apply by December 15th!
19.11.2024 10:38 β π 305 π 136 π¬ 10 π 12
8/ β€οΈAcknowledgements:
OpenScholar is the result of a collaborative effort UW, Ai2 and many others!
Huge thanks to our incredible team including experts from CS, Bio, and physics, for making this possible!
Weβd love your feedback! Reply or email us with questions, ideas, or use casesβ¨
19.11.2024 16:33 β π 1 π 0 π¬ 0 π 0
Ai2 OpenScholar
8/ π§ͺ Summary
Try it out: openscholar.allen.ai
Read more: allenai.org/blog/opensch... β we discuss more details as well as limitations of OpenScholar, based on our beta testing with CS researchers!
Code & data: github.com/AkariAsai/Op...
Paper: openscholar.allen.ai/paper
19.11.2024 16:33 β π 3 π 0 π¬ 1 π 0
Ai2 OpenScholar
7/ π Whatβs next?
We're just getting started with OpenScholar! π
Expanding domains: Support for non-CS fields is coming soon. Public API: Full-text search over 45M+ papers will be available shortly.
Try the OpenScholar demo and share your feedback!
openscholar.allen.ai
19.11.2024 16:33 β π 3 π 0 π¬ 1 π 0
Ai2 OpenScholar
6/ πΎ Open Access [2]:
π OpenScholar Datastore (45M+ papers up to 2024/10): huggingface.co/datasets/Ope...
π ScholarQABench: github.com/AkariAsai/Sc...
π©βπ¬ Human evaluation interface: github.com/AkariAsai/Op...
19.11.2024 16:33 β π 2 π 0 π¬ 1 π 0
Ai2 OpenScholar
6/ πΎ Open Access [1]:
Prior work in this area has relied on proprietary LMs and/or released only a subset of datastore
We're releasing
Demo: openscholar.allen.ai
π Code & model checkpoints:
github.com/AkariAsai/Op...
huggingface.co/collections/...
19.11.2024 16:33 β π 4 π 0 π¬ 1 π 0
5/ π Exert Evaluation Results:
We further conduct expert evaluations with scientists across CS, Bio and Physics, comparing OS against expert answers.
Scientists preferred OpenScholar-8B outputs compared to human-written answers in majority of the times, thanks to its coverage
19.11.2024 16:33 β π 2 π 0 π¬ 1 π 0
5/ π Automatic Results:
So how good OpenScholar?
On ScholarBench, OpenScholar-8B surpassed GPT-4o, concurrent PaperQA2, and other models in factuality & citation accuracy despite being many times cheaper!
19.11.2024 16:33 β π 2 π 0 π¬ 1 π 0
4/ π§ͺNew dataset: ScholarBench
A benchmark for evaluating scientific language models on real-world, open-ended questions requiring synthesis across multiple papers. π
π 7 datasets across four scientific disciplines
π§βπ¬ 2,000+ expert-annotated question and 200 answers
π Automated metrics
19.11.2024 16:33 β π 4 π 0 π¬ 1 π 0
3/ π What is OpenScholar?
It's a retrieval-augmented LM with
1οΈβ£ a datastore of 45M+ open-access papers
2οΈβ£ a specialized retriever and reranker to search the datastore
3οΈβ£ an 8B Llama fine-tuned LM trained on high-quality synthetic data
4οΈβ£ a self-feedback generation pipeline
19.11.2024 16:33 β π 3 π 1 π¬ 1 π 0
2/ ποΈ On the shoulders of giants
With millions of papers published yearly, keeping up with scientific literature has become a monumental challenge. α΄α΄α΄Ι΄κ±α΄Κα΄Κα΄Κ aims to help researchers navigate this vast landscape by synthesizing grounded, citation-supported answers from academic papers.
19.11.2024 16:33 β π 4 π 0 π¬ 1 π 0
1/ Introducing α΄α΄α΄Ι΄κ±α΄Κα΄Κα΄Κ: a retrieval-augmented LM to help scientists synthesize knowledge π
@uwnlp.bsky.social & Ai2
With open models & 45M-paper datastores, it outperforms proprietary systems & match human experts.
Try out our demo!
openscholar.allen.ai
19.11.2024 16:30 β π 161 π 39 π¬ 6 π 8
Assistant Professor at @cs.ubc.caβ¬ and βͺ@vectorinstitute.aiβ¬ working on Natural Language Processing. Book: https://lostinautomatictranslation.com/
Postdoc @LTIatCMU. PhD from Ohio State @osunlp. Author of MMMU, MAmmoTH. Training & evaluating foundation models. Previously @MSFTResearch. Opinions are my own.
Researcher trying to shape AI towards positive outcomes. ML & Ethics +birds. Generally trying to do the right thing. TIME 100 | TED speaker | Senate testimony provider | Navigating public life as a recluse.
Former: Google, Microsoft; Current: Hugging Face
Book: https://thecon.ai
Web: https://faculty.washington.edu/ebender
Assistant professor at https://si.umich.edu/ working in computational social science, machine learning, and NLP | https://dallascard.github.io
AI safety at Anthropic, on leave from a faculty job at NYU.
Views not employers'.
I think you should join Giving What We Can.
cims.nyu.edu/~sbowman
https://najoung.kim
langauge
ai research @ thinking machines . realtime video+voice. i like trains and bikes. sometimes I climb rocks and throw pottery.
Assoc. Prof in CS @ Northeastern, NLP/ML & health & etc. He/him.
AI @ OpenAI, Tesla, Stanford
I lead Cohere For AI. Formerly Research
Google Brain. ML Efficiency, LLMs,
@trustworthy_ml.
Director, MIT Computational Psycholinguistics Lab. President, Cognitive Science Society. Chair of the MIT Faculty. Open access & open science advocate. He.
Lab webpage: http://cpl.mit.edu/
Personal webpage: https://www.mit.edu/~rplevy
He teaches information science at Cornell. http://mimno.infosci.cornell.edu
Professor, Santa Fe Institute. Research on AI, cognitive science, and complex systems.
Website: https://melaniemitchell.me
Substack: https://aiguide.substack.com/
Assistant Professor the Polaris Lab @ Princeton (https://www.polarislab.org/); Researching: RL, Strategic Decision-Making+Exploration; AI+Law
Associate prof at @UMich in SI and CSE working in computational social science and natural language processing. PI of the Blablablab blablablab.si.umich.edu
PhD Student @MIT | Previous @allen_ai | #NLP #HCI | www.szj.io
Thinking about content moderation, equity, machine learning, and natural language processing.
Now: Chancellor's Fellow (~Asst. Prof) @technomoralfutures.bsky.social @edinburgh-uni.bsky.social
Past: MBZUAI, SFU, Uni of. {Sheffield, CPH}
Associate Professor (Linguistics) at University of Washington
https://shane.st