Andreas Chari @andreaschari

Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer Globalisation and colonisation have led the vast majority of the world to use only a fraction of languages, such as English and French, to communicate, excluding many others. This has severely affecte...

Check out all the details and more findings here:
arxiv.org/abs/2503.22508

I will also present this at the #IR4GOOD at #ECIR2025 next week, alongside many other contributions from
@irglasgow.bsky.social. Looking forward to continuing these discussions at #SIGIR2025.

02.04.2025 08:27 — 👍 1 🔁 0 💬 0 📌 0

Then, we try to zero-shot these fine-tuned models on other language pairs. Some are related to French-Catalan, such as Occitan, and some are entirely unrelated, such as Mandarin.

We see it does transfer to Occitan-French pairs and in Cantonese-Mandarin pairs (more on the paper)

02.04.2025 08:27 — 👍 1 🔁 0 💬 1 📌 0

We fine-tuned them on Catalan queries and French Docs and see that we can regularise the models to be more robust on Catalan (and we see some gains in French!)

02.04.2025 08:27 — 👍 0 🔁 0 💬 1 📌 0

What will happen if you fine-tune neural rankers such as BGE-M3 and ColBERT-XM on low-resource queries and high-resource documents of two different (albeit related) languages?

Will it help regularize the rankers on these similarities?

02.04.2025 08:27 — 👍 1 🔁 0 💬 1 📌 0

02.04.2025 08:27 — 👍 0 🔁 0 💬 1 📌 0

We translated five collections of mMARCO into similar languages and evaluated retrieval methods based on how well they would perform if the queries were expressed in a similar low-resource language.

It turns out they do not perform very well (an understatement).

02.04.2025 08:27 — 👍 0 🔁 0 💬 1 📌 0

🚨 New Pre-Print!🚨 with @macavaney.bsky.social &
@iadhounis.bsky.social. Stop using "translate-train" for all your multilingual needs. We explore zero-shot transfer for low-resource languages... 🧵

02.04.2025 08:27 — 👍 3 🔁 1 💬 1 📌 0

🇮🇹🇮🇹🇮🇹
I am happy to share that our work with @macavaney.bsky.social and @iadhounis.bsky.social , "Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer", has been accepted to the #ecir2025 "IR for Good" track.

17.12.2024 14:12 — 👍 10 🔁 0 💬 0 📌 0

Demo paper entitled “FinPersona: An LLM-Driven Conversational Agent for Personalized Financial Advising” has been accepted to #ecir2025 - output of a collaboration with University of Tokyo with T Takayanagi, M Suzuki, K Izumi, R McCreadie, @javiersanzcruza.bsky.social and @iadhounis.bsky.social

16.12.2024 13:15 — 👍 14 🔁 3 💬 0 📌 0

#IR4Good paper entitled “Fair Exposure Allocation Using Generative Query Expansion” has been accepted to #ecir2025 - work by @tjaenich.bsky.social @grahammcdonald.bsky.social and @iadhounis.bsky.social

16.12.2024 13:55 — 👍 15 🔁 3 💬 0 📌 1

Happy to share that our paper "Improving novelty and diversity of nearest-neighbors recommendation by exploiting dissimilarities" with @psperez.bsky.social and @abellogin.bsky.social has been accepted to #ECIR2025 IR4Good track!

16.12.2024 14:16 — 👍 19 🔁 2 💬 0 📌 0

Full length paper entitled “One size doesn’t fit all: Predicting the Number of Examples for In-Context Learning” has been accepted at #ecir2025 - work by Manish Chandra, @gdebasis.bsky.social and @iadhounis.bsky.social

16.12.2024 19:58 — 👍 11 🔁 2 💬 0 📌 0

Full length paper entitled “A Multi-modal Recipe for Improved Multi-domain Recommendation” has been accepted at #ecir2025 - work by @zixuanyi.bsky.social and @iadhounis.bsky.social

16.12.2024 19:59 — 👍 20 🔁 3 💬 0 📌 0

#IR4Good paper entitled “Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer” has been accepted to #ecir2025 - work by @andreaschari.bsky.social @macavaney.bsky.social and @iadhounis.bsky.social

16.12.2024 13:52 — 👍 13 🔁 2 💬 0 📌 0

Detecting hallucinations in large language models using semantic entropy - Nature Hallucinations (confabulations) in large language model systems can be tackled by measuring uncertainty about the meanings of generated responses rather than the text itself to improve question-a...

Tomorrow, in our weekly reading group series, we will be discussing the recent Nature paper entitled “Detecting hallucinations in large language models using semantic entropy“ by Farquhar et al. #IRGlasgowReadingGroup
www.nature.com/articles/s41...

21.11.2024 13:58 — 👍 14 🔁 1 💬 0 📌 0

Today, Aixin Sun from the Nanyang Technological University is giving an #IRTalk entitled "Understanding and Evaluating Recommender Systems from a User Perspective". Details: samoa.dcs.gla.ac.uk/events/viewt...

@uofgcompsci.bsky.social
@irglasgow.bsky.social

18.11.2024 17:26 — 👍 10 🔁 3 💬 0 📌 0

At 15:00 on 25th November, Jianling Wang from Google DeepMind will give an #IRTalk entitled "When LLMs Meet Recommendations: Scalable Hybrid Approaches to Enhance User Experiences". Details: samoa.dcs.gla.ac.uk/events/viewt...

@uofgcompsci.bsky.social
@irglasgow.bsky.social

18.11.2024 16:47 — 👍 14 🔁 6 💬 0 📌 0

Hello everyone! Happy to be here! If you are interested on the research that we do at @irglasgow.bsky.social (IR, RecSys, NLP), I have created an starter pack for you!

Access it here: go.bsky.app/BM6iHbU

I will keep updating it over time as more people in our team join BlueSky!

18.11.2024 14:03 — 👍 13 🔁 6 💬 0 📌 1

Andreas Chari

Latest posts by andreaschari.bsky.social on Bluesky

@andreaschari is following 20 prominent accounts