Andreas Chari's Avatar

Andreas Chari

@andreaschari.bsky.social

@irglasgow.bsky.social PhD Student, University of Glasgow. Researching multilingual NLP & IR Supervisors: @macavaney.bsky.social & @iadhounis.bsky.social. ๐Ÿ‡จ๐Ÿ‡พ Views my own

76 Followers  |  265 Following  |  8 Posts  |  Joined: 18.11.2024  |  1.437

Latest posts by andreaschari.bsky.social on Bluesky

Preview
Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer Globalisation and colonisation have led the vast majority of the world to use only a fraction of languages, such as English and French, to communicate, excluding many others. This has severely affecte...

Check out all the details and more findings here:
arxiv.org/abs/2503.22508

I will also present this at the #IR4GOOD at #ECIR2025 next week, alongside many other contributions from
@irglasgow.bsky.social. Looking forward to continuing these discussions at #SIGIR2025.

02.04.2025 08:27 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Then, we try to zero-shot these fine-tuned models on other language pairs. Some are related to French-Catalan, such as Occitan, and some are entirely unrelated, such as Mandarin.

We see it does transfer to Occitan-French pairs and in Cantonese-Mandarin pairs (more on the paper)

02.04.2025 08:27 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

We fine-tuned them on Catalan queries and French Docs and see that we can regularise the models to be more robust on Catalan (and we see some gains in French!)

02.04.2025 08:27 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

What will happen if you fine-tune neural rankers such as BGE-M3 and ColBERT-XM on low-resource queries and high-resource documents of two different (albeit related) languages?

Will it help regularize the rankers on these similarities?

02.04.2025 08:27 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image 02.04.2025 08:27 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We translated five collections of mMARCO into similar languages and evaluated retrieval methods based on how well they would perform if the queries were expressed in a similar low-resource language.

It turns out they do not perform very well (an understatement).

02.04.2025 08:27 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿšจ New Pre-Print!๐Ÿšจ with @macavaney.bsky.social &
@iadhounis.bsky.social. Stop using "translate-train" for all your multilingual needs. We explore zero-shot transfer for low-resource languages... ๐Ÿงต

02.04.2025 08:27 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ‡ฎ๐Ÿ‡น๐Ÿ‡ฎ๐Ÿ‡น๐Ÿ‡ฎ๐Ÿ‡น
I am happy to share that our work with @macavaney.bsky.social and @iadhounis.bsky.social , "Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer", has been accepted to the #ecir2025 "IR for Good" track.

17.12.2024 14:12 โ€” ๐Ÿ‘ 10    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Demo paper entitled โ€œFinPersona: An LLM-Driven Conversational Agent for Personalized Financial Advisingโ€ has been accepted to #ecir2025 - output of a collaboration with University of Tokyo with T Takayanagi, M Suzuki, K Izumi, R McCreadie, @javiersanzcruza.bsky.social and @iadhounis.bsky.social

16.12.2024 13:15 โ€” ๐Ÿ‘ 14    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

#IR4Good paper entitled โ€œFair Exposure Allocation Using Generative Query Expansionโ€ has been accepted to #ecir2025 - work by @tjaenich.bsky.social @grahammcdonald.bsky.social and @iadhounis.bsky.social

16.12.2024 13:55 โ€” ๐Ÿ‘ 15    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Happy to share that our paper "Improving novelty and diversity of nearest-neighbors recommendation by exploiting dissimilarities" with @psperez.bsky.social and @abellogin.bsky.social has been accepted to #ECIR2025 IR4Good track!

16.12.2024 14:16 โ€” ๐Ÿ‘ 19    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Full length paper entitled โ€œOne size doesnโ€™t fit all: Predicting the Number of Examples for In-Context Learningโ€ has been accepted at #ecir2025 - work by Manish Chandra, @gdebasis.bsky.social and @iadhounis.bsky.social

16.12.2024 19:58 โ€” ๐Ÿ‘ 11    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Full length paper entitled โ€œA Multi-modal Recipe for Improved Multi-domain Recommendationโ€ has been accepted at #ecir2025 - work by @zixuanyi.bsky.social and @iadhounis.bsky.social

16.12.2024 19:59 โ€” ๐Ÿ‘ 20    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

#IR4Good paper entitled โ€œImproving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transferโ€ has been accepted to #ecir2025 - work by @andreaschari.bsky.social @macavaney.bsky.social and @iadhounis.bsky.social

16.12.2024 13:52 โ€” ๐Ÿ‘ 13    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Detecting hallucinations in large language models using semantic entropy - Nature Hallucinations (confabulations) in large language model systems can be tackled by measuring uncertainty about the meanings of generated responses rather than the text itself to improve question-a...

Tomorrow, in our weekly reading group series, we will be discussing the recent Nature paper entitled โ€œDetecting hallucinations in large language models using semantic entropyโ€œ by Farquhar et al. #IRGlasgowReadingGroup
www.nature.com/articles/s41...

21.11.2024 13:58 โ€” ๐Ÿ‘ 14    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image Post image

Today, Aixin Sun from the Nanyang Technological University is giving an #IRTalk entitled "Understanding and Evaluating Recommender Systems from a User Perspective". Details: samoa.dcs.gla.ac.uk/events/viewt...

@uofgcompsci.bsky.social
@irglasgow.bsky.social

18.11.2024 17:26 โ€” ๐Ÿ‘ 10    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

At 15:00 on 25th November, Jianling Wang from Google DeepMind will give an #IRTalk entitled "When LLMs Meet Recommendations: Scalable Hybrid Approaches to Enhance User Experiences". Details: samoa.dcs.gla.ac.uk/events/viewt...

@uofgcompsci.bsky.social
@irglasgow.bsky.social

18.11.2024 16:47 โ€” ๐Ÿ‘ 14    ๐Ÿ” 6    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Hello everyone! Happy to be here! If you are interested on the research that we do at @irglasgow.bsky.social (IR, RecSys, NLP), I have created an starter pack for you!

Access it here: go.bsky.app/BM6iHbU

I will keep updating it over time as more people in our team join BlueSky!

18.11.2024 14:03 โ€” ๐Ÿ‘ 13    ๐Ÿ” 6    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

@andreaschari is following 20 prominent accounts