Manuel @chasinggradients - Bluesky Profile

Discontinuation of the MS Word Template ARR abandons the MS Word Template for conference submissions. The submissions based on the Word template will be desk-rejected starting from March 2026.

📝 Discontinuation of the MS Word Template
ARR will now fully adopt the LaTeX template to streamline formatting and reduce review workload.
Starting March 2026, submissions using the MS Word template will be desk-rejected.
Check details here: aclrollingreview.org/discontinuat...
#ARR #NLProc

30.10.2025 21:40 — 👍 9 🔁 5 💬 0 📌 0

The MTEB team has just released MTEB v2, an upgrade to their evaluation suite for embedding models!

Their blogpost covers all changes, including easier evaluation, multimodal support, rerankers, new interfaces, documentation, dataset statistics, a migration guide, etc.

🧵

20.10.2025 14:36 — 👍 7 🔁 2 💬 1 📌 0

PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs Current evaluations of sentence embedding models typically rely on static test beds such as the Massive Text Embedding Benchmark (MTEB). While invaluable, repeated tuning on a fixed suite can inflate ...

PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs

Introduces a dynamic evaluation protocol that generates meaning-preserving paraphrases at evaluation time to assess embedding model robustness.

📝 arxiv.org/abs/2510.06730

09.10.2025 02:32 — 👍 1 🔁 1 💬 0 📌 0

AI bots wrote and reviewed all papers at this conference Event will assess how reviews by models compare with those written by humans.

🧪 A new computer science conference, Agents4Science, will feature papers written and peer-reviewed entirely by AI agents. The event serves as a sandbox to evaluate the quality of machine-generated research and its review process.
#MLSky

15.10.2025 15:33 — 👍 4 🔁 2 💬 0 📌 0

ACL 2026 Visa Invitation Letter Request Please use this form to request a visa invitation letter. The letter will be sent to the email address provided below.

✈️ Visa Letter Requests for ACL 2026
If you intend to commit your paper to ACL 2026 and require an invitation letter for visa purposes, please fill out the visa request form as soon as possible.
(docs.google.com/forms/d/e/1F...)

#ARR #ACL #NLProc

14.10.2025 15:27 — 👍 8 🔁 5 💬 0 📌 0

Illustration of a woman wearing a graduation cap and a lab coat, holding a magnifying glass and examining a document. A mechanical parrot with gears and circuits is perched on her shoulder.

🚀 𝗟𝗮𝘁𝗲𝘀𝘁 𝗣𝗲𝗲𝗿 𝗥𝗲𝘃𝗶𝗲𝘄 𝗗𝗮𝘁𝗮𝘀𝗲𝘁 𝗥𝗲𝗹𝗲𝗮𝘀𝗲 𝗳𝗿𝗼𝗺 𝗔𝗥𝗥 𝟮𝟬𝟮𝟱!
tudatalib.ulb.tu-darmstadt.de/handle/tudat...

📊 𝗡𝗲𝘄𝗹𝘆 𝗮𝗱𝗱𝗲𝗱 𝗔𝗖𝗟 𝟮𝟬𝟮𝟱 𝗱𝗮𝘁𝗮:
✅ 𝟮𝗸 papers
✅ 𝟮𝗸 reviews
✅ 𝟴𝟰𝟵 meta-reviews
✅ 𝟭.𝟱𝗸 papers with rebuttals

(1/🧵)

08.10.2025 06:57 — 👍 3 🔁 2 💬 1 📌 0

Google just released a 270M parameter Gemma model. As a tiny model lover I'm excited. Models in this size class are usually barely coherent, I'll give it a try today to see how this does. developers.googleblog.com/en/introduci...

14.08.2025 16:38 — 👍 48 🔁 2 💬 2 📌 1

I just released Sentence Transformers v4.1; featuring ONNX and OpenVINO backends for rerankers offering 2-3x speedups and improved hard negatives mining which helps prepare stronger training datasets.

Details in 🧵

15.04.2025 13:54 — 👍 11 🔁 4 💬 1 📌 0

a cat holding a sign that says help ALT: a cat holding a sign that says help

🗣️Call for emergency reviewers

I am serving as an AC for #ICML2025, seeking emergency reviewers for two submissions

Are you an expert of Knowledge Distillation or AI4Science?

If so, send me DM with your Google Scholar profile and OpenReview profile

Thank you!

20.03.2025 05:25 — 👍 2 🔁 1 💬 0 📌 1

We've just released MMTEB, our multilingual upgrade to the MTEB Embedding Benchmark!

It's a huge collaboration between 56 universities, labs, and organizations, resulting in a massive benchmark of 1000+ languages, 500+ tasks, and a dozen+ domains.

Details in 🧵

21.02.2025 15:06 — 👍 23 🔁 4 💬 2 📌 0

Can Cross Encoders Produce Useful Sentence Embeddings? Cross encoders (CEs) are trained with sentence pairs to detect relatedness. As CEs require sentence pairs at inference, the prevailing view is that they can only be used as re-rankers in information r...

Can Cross Encoders Produce Useful Sentence Embeddings?

IBM discovered that early cross encoders layers can produce effective sentence embeddings, enabling 5.15x faster inference while maintaining comparable accuracy to full dual encoders.

📝 arxiv.org/abs/2502.03552

07.02.2025 03:34 — 👍 3 🔁 1 💬 0 📌 0

Same issue for me

30.01.2025 19:53 — 👍 0 🔁 0 💬 0 📌 0

Distiling DeepSeek reasoning to ModernBERT classifiers How can we use the reasoning ability of DeepSeek to generate synthetic labels for fine tuning a ModernBERT model?

Why choose between strong #LLM reasoning and efficient models?

Use DeepSeek to generate high-quality training data, then distil that knowledge into ModernBERT for fast, efficient classification.

New blog post: danielvanstrien.xyz/posts/2025/d...

29.01.2025 10:07 — 👍 58 🔁 11 💬 2 📌 4

People often claim they know when ChatGPT wrote something, but are they as accurate as they think?

Turns out that while general population is unreliable, those who frequently use ChatGPT for writing tasks can spot even "humanized" AI-generated text with near-perfect accuracy 🎯

28.01.2025 14:55 — 👍 188 🔁 66 💬 10 📌 19

A Test So Hard No AI System Can Pass It — Yet (Gift Article) The creators of a new test called “Humanity’s Last Exam” argue we may soon lose the ability to create tests hard enough for A.I. models.

I wrote about a new AI evaluation called "Humanity's Last Exam," a collection of 3,000 questions submitted by leading academics to try to stump leading AI models, which mostly find today's college-level tests too easy.

www.nytimes.com/2025/01/23/t...

23.01.2025 16:41 — 👍 208 🔁 46 💬 17 📌 15

I just released Sentence Transformers v3.4.0, featuring a memory leak fix (memory not being cleared upon model & trainer deletion), compatibility between the powerful Cached... losses and the Matryoshka loss modifier, and a bunch of fixes & small features.

Details in 🧵

23.01.2025 16:44 — 👍 12 🔁 4 💬 2 📌 0

10th Workshop on Representation Learning for NLP - Call for Papers The 10th Workshop on Representation Learning for NLP (RepL4NLP 2025), co-located with NAACL 2025 in Albuquerque, New Mexico, invites papers of a theoretical or experimental nature describing recent ad...

Disappointed with #ICLR or #NAACL reviews? Consider submitting your work at #Repl4NLP, whether it's full papers, extended abstracts, or cross-submissions. 🔥

Details on submissions 👉 sites.google.com/view/repl4nl...

⏰ Deadline January 30

23.01.2025 16:30 — 👍 2 🔁 2 💬 0 📌 1

Introducing Phi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning | Microsoft Community Hub Today we are introducing Phi-4, our 14B parameter state-of-the-art small language model (SLM) that excels at complex reasoning in areas such as math, in...

Microsoft’s latest small language model - phi-4 - is open source and now available on Hugging Face techcommunity.microsoft.com/blog/aiplatf...

09.01.2025 15:40 — 👍 10 🔁 4 💬 0 📌 0

LLMs are Also Effective Embedding Models: An In-depth Overview Large language models (LLMs) have revolutionized natural language processing by achieving state-of-the-art performance across various tasks. Recently, their effectiveness as embedding models has gaine...

LLMs are Also Effective Embedding Models: An In-depth Overview

Provides a comprehensive analysis on adopting LLMs as embedding models, examining both zero-shot prompting and tuning strategies to derive competitive text embeddings vs traditional models.

📝 arxiv.org/abs/2412.12591

18.12.2024 06:40 — 👍 1 🔁 1 💬 0 📌 0

You and users of the "but humans"-argument assume different goals for AI.

The argument assumes that the goal is to develop human-level AI. (Or it's used to counter statements claiming AI systems are less intelligent than humans.) It's not a direct argument for their usefulness.

01.01.2025 10:23 — 👍 2 🔁 0 💬 0 📌 0

Not sure about this idea (and also the objective of maximizing impact), but I really like the "plain language summary" I've seen in some medical papers.

25.12.2024 10:05 — 👍 1 🔁 0 💬 0 📌 0

How Hallucinatory A.I. Helps Science Dream Up Big Breakthroughs (Gift Article) Hallucinations, a bane of popular A.I. programs, turn out to be a boon for venturesome scientists eager to push back the frontiers of human knowledge.

The upside of A.I. hallucination
gift link www.nytimes.com/2024/12/23/s...

24.12.2024 15:05 — 👍 103 🔁 30 💬 10 📌 4

What happened to BERT & T5? On Transformer Encoders, PrefixLM and Denoising Objectives — Yi Tay A Blogpost series about Model Architectures Part 1: What happened to BERT and T5? Thoughts on Transformer Encoders, PrefixLM and Denoising objectives

Good blog post on good old encoder-style models.

Glad to see ModernBERT recently brought something new to the field. So don't count BERT as GOFAI yet.

www.yitay.net/blog/model-a...

24.12.2024 12:16 — 👍 0 🔁 0 💬 0 📌 0

Announcement #1: our call for papers is up! 🎉
colmweb.org/cfp.html
And excited to announce the COLM 2025 program chairs @yoavartzi.com @eunsol.bsky.social @ranjaykrishna.bsky.social and @adtraghunathan.bsky.social

17.12.2024 15:48 — 👍 66 🔁 24 💬 0 📌 1

Computer Science Conference Deadlines Map Interactive world map of Computer Science, AI, and ML conference deadlines

Unsure where to submit your next research paper to now that aideadlin.es is not updated anymore? And let’s be honest, is the location not as important as the conference itself?

🗺️ Check out my latest side-project: deadlines.pieter.ai

23.12.2024 14:39 — 👍 13 🔁 4 💬 0 📌 0

🧪 New pre-print explores generative AI’s in medicine, highlighting applications for clinicians, patients, researchers, and educators. It also addresses challenges like privacy, transparency, and equity.
Additional details from the author linked below.
🩺🖥️
Direct link: arxiv.org/abs/2412.10337

22.12.2024 15:03 — 👍 20 🔁 4 💬 1 📌 0

Instead of listing my publications, as the year draws to an end, I want to shine the spotlight on the commonplace assumption that productivity must always increase. Good research is disruptive and thinking time is central to high quality scholarship and necessary for disruptive research.

20.12.2024 11:18 — 👍 1154 🔁 375 💬 21 📌 57

IMO, there's a great discussion over there (in my timeline, not the for yoy tab) with interesting insights from the OpenAI team.

Bluesky isn't there (yet).

22.12.2024 13:24 — 👍 4 🔁 0 💬 0 📌 0

Re your question at the end:

22.12.2024 13:18 — 👍 1 🔁 0 💬 1 📌 0

I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧵

19.12.2024 16:45 — 👍 620 🔁 147 💬 19 📌 34

Manuel

Latest posts by chasinggradients.bsky.social on Bluesky

@chasinggradients is following 20 prominent accounts