Then, we try to zero-shot these fine-tuned models on other language pairs. Some are related to French-Catalan, such as Occitan, and some are entirely unrelated, such as Mandarin.
We see it does transfer to Occitan-French pairs and in Cantonese-Mandarin pairs (more on the paper)
02.04.2025 08:27 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
We fine-tuned them on Catalan queries and French Docs and see that we can regularise the models to be more robust on Catalan (and we see some gains in French!)
02.04.2025 08:27 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
What will happen if you fine-tune neural rankers such as BGE-M3 and ColBERT-XM on low-resource queries and high-resource documents of two different (albeit related) languages?
Will it help regularize the rankers on these similarities?
02.04.2025 08:27 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
02.04.2025 08:27 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
We translated five collections of mMARCO into similar languages and evaluated retrieval methods based on how well they would perform if the queries were expressed in a similar low-resource language.
It turns out they do not perform very well (an understatement).
02.04.2025 08:27 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
๐จ New Pre-Print!๐จ with @macavaney.bsky.social &
@iadhounis.bsky.social. Stop using "translate-train" for all your multilingual needs. We explore zero-shot transfer for low-resource languages... ๐งต
02.04.2025 08:27 โ ๐ 3 ๐ 1 ๐ฌ 1 ๐ 0
๐ฎ๐น๐ฎ๐น๐ฎ๐น
I am happy to share that our work with @macavaney.bsky.social and @iadhounis.bsky.social , "Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer", has been accepted to the #ecir2025 "IR for Good" track.
17.12.2024 14:12 โ ๐ 10 ๐ 0 ๐ฌ 0 ๐ 0
Demo paper entitled โFinPersona: An LLM-Driven Conversational Agent for Personalized Financial Advisingโ has been accepted to #ecir2025 - output of a collaboration with University of Tokyo with T Takayanagi, M Suzuki, K Izumi, R McCreadie, @javiersanzcruza.bsky.social and @iadhounis.bsky.social
16.12.2024 13:15 โ ๐ 14 ๐ 3 ๐ฌ 0 ๐ 0
#IR4Good paper entitled โFair Exposure Allocation Using Generative Query Expansionโ has been accepted to #ecir2025 - work by @tjaenich.bsky.social @grahammcdonald.bsky.social and @iadhounis.bsky.social
16.12.2024 13:55 โ ๐ 15 ๐ 3 ๐ฌ 0 ๐ 1
Happy to share that our paper "Improving novelty and diversity of nearest-neighbors recommendation by exploiting dissimilarities" with @psperez.bsky.social and @abellogin.bsky.social has been accepted to #ECIR2025 IR4Good track!
16.12.2024 14:16 โ ๐ 19 ๐ 2 ๐ฌ 0 ๐ 0
Full length paper entitled โOne size doesnโt fit all: Predicting the Number of Examples for In-Context Learningโ has been accepted at #ecir2025 - work by Manish Chandra, @gdebasis.bsky.social and @iadhounis.bsky.social
16.12.2024 19:58 โ ๐ 11 ๐ 2 ๐ฌ 0 ๐ 0
Full length paper entitled โA Multi-modal Recipe for Improved Multi-domain Recommendationโ has been accepted at #ecir2025 - work by @zixuanyi.bsky.social and @iadhounis.bsky.social
16.12.2024 19:59 โ ๐ 20 ๐ 3 ๐ฌ 0 ๐ 0
#IR4Good paper entitled โImproving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transferโ has been accepted to #ecir2025 - work by @andreaschari.bsky.social @macavaney.bsky.social and @iadhounis.bsky.social
16.12.2024 13:52 โ ๐ 13 ๐ 2 ๐ฌ 0 ๐ 0
At 15:00 on 25th November, Jianling Wang from Google DeepMind will give an #IRTalk entitled "When LLMs Meet Recommendations: Scalable Hybrid Approaches to Enhance User Experiences". Details: samoa.dcs.gla.ac.uk/events/viewt...
@uofgcompsci.bsky.social
@irglasgow.bsky.social
18.11.2024 16:47 โ ๐ 14 ๐ 6 ๐ฌ 0 ๐ 0
Hello everyone! Happy to be here! If you are interested on the research that we do at @irglasgow.bsky.social (IR, RecSys, NLP), I have created an starter pack for you!
Access it here: go.bsky.app/BM6iHbU
I will keep updating it over time as more people in our team join BlueSky!
18.11.2024 14:03 โ ๐ 13 ๐ 6 ๐ฌ 0 ๐ 1
Senior MLE at Meta. Trying to keep up with the Information Retrieval domain!
Blog: https://blog.reachsumit.com/
Newsletter: https://recsys.substack.com/
Lecturer at @UofGlasgow Affiliated Lecturer at @CambridgeLTL Working on IR, NLP, KGs, AI Agents, http://AI4BioMed.org and https://github.com/EvoAgentX/EvoAgentX
A researcher in Information Retrieval and NLP... Lecturer at the University of Glasgow.
Postdoctoral Researcher at AMCO/DEI, University of Padua; PhD from University of Glasgow. Fairness, Quantification, Transfer Learning. pugantsov.com
Lecturer at @uofgcompsci.bsky.social using machine learning to gather biomedical knowledge.
navigating tech and its unholy masters
๐ผ r&d at pex.com
๐จ๐ป dad at home
AI @ OpenAI, Tesla, Stanford
building chatgpt at openai
prev quizlet, twitter, mit
ankushg.com
data science @ openai
here for data, transit, urbanism, nyc
Research Scientist at Google DeepMind. เฒเฒจเณเฒจเฒกเฒฟเฒ.
Past: Researchoor, Algorithms team at OpenAI & with Juergen Schmidhuber.
Reverse engineering neural networks at Anthropic. Previously Distill, OpenAI, Google Brain.Personal account.
Engineering at OpenAI. Formerly working on Fuschia at Google
ai research @ thinking machines . realtime video+voice. i like trains and bikes. sometimes I climb rocks and throw pottery.
Chief Scientist at the UK AI Security Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc.
policy for v smart things @openai. Past: PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Posts my own; on my head be it
Forever expanding my nerd/bimbo Pareto frontier. Ex-OpenAI, AGI safety and governance, fellow @rootsofprogress.
Voice/Multimodal at OpenAI. Started the ChatGPT Android team.