π We're building a new type of word processor at Marker, and we're hiring for React/ProseMirror engineers and full-stack AI engineers to join the team in London.
Are you an engineer who cares about writing? Or do you know someone who does?
See: writewithmarker.com/jobs
More details below π
04.06.2025 17:11 β π 24 π 11 π¬ 2 π 1
Sorted, thanks!
02.04.2025 22:07 β π 0 π 0 π¬ 0 π 0
Youβll collaborate with a kind, curious, research-driven teamβincluding the brilliant @joao.omg.lol & @martinklissarov.bsky.social βand get to shape work at the frontier of multi-agent learning.
If that sounds like you, apply!
DM me if you're curious or have questions
02.04.2025 09:57 β π 5 π 0 π¬ 1 π 0
Some big questions weβre thinking about:
1β£How do communication protocols emerge?
2β£What inductive biases help coordination?
3β£How can language improve generalisation and transfer?
02.04.2025 09:57 β π 6 π 0 π¬ 1 π 0
Weβre interested in:
π€π€ Multi-agent RL
π Emergent language
π² Communication games
π§ Social & cognitive modelling
π Scaling laws for coordination
02.04.2025 09:57 β π 4 π 0 π¬ 1 π 0
The project explores how agents can learn to communicate and coordinate in complex, open-ended environmentsβthrough emergent protocols, not hand-coded rules.
02.04.2025 09:57 β π 3 π 0 π¬ 1 π 0
π¨ Iβm hosting a Student Researcher @GoogleDeepMind!
Join us on the Autonomous Assistants team (led by
@egrefen.bsky.social) to explore multi-agent communicationβhow agents learn to interact, coordinate, and solve tasks together.
DM me for details!
02.04.2025 09:57 β π 14 π 3 π¬ 1 π 0
Our full paper:
arxiv.org/pdf/2503.19711
02.04.2025 09:51 β π 3 π 0 π¬ 0 π 0
Our work highlights the need for LLMs to improve in areas like action selection, self-evaluation + goal alignment to perform robustly in open-ended tasks
Implications of this work extend beyond writing assistance to autonomous workflows for LLMs in general open-ended use cases
02.04.2025 09:51 β π 2 π 0 π¬ 1 π 0
Finding: LLMs can lose track of the original goal during iterative refinement, leading to "semantic drift" - a divergence from the author's intent. This is a key challenge for autonomous revision. βοΈ
02.04.2025 09:51 β π 4 π 0 π¬ 1 π 0
Finding: LLMs struggle to reliably filter their own suggestions. They need better self-evaluation to work effectively in autonomous revision workflows. βοΈ
02.04.2025 09:51 β π 3 π 0 π¬ 1 π 0
Finding: Gemini 1.5 Pro produced the highest quality editing suggestions, according to human evaluators, outperforming Claude 3.5 Sonnet and GPT-4o π¦Ύ
02.04.2025 09:51 β π 4 π 0 π¬ 1 π 0
Finding: LLMs tend to favour adding content, whereas human editors remove or restructure more. This suggests LLMs are sycophantic, reinforcing existing text rather than critically evaluating it. β
02.04.2025 09:51 β π 4 π 0 π¬ 1 π 0
Why? There are many possible solutions and no single 'right' answer. Success is difficult to gauge!
We examine how LLMs generate + select text revisions, comparing their actions to human editors. We focus on action diversity, alignment with human prefs, and iterative improvement
02.04.2025 09:51 β π 2 π 0 π¬ 1 π 0
Our paper explores this by analysing LLMs as autonomous co-writers. Work done with Lucia Lopez Rivilla, @egrefen.bsky.social ) π«Ά
Open-ended tasks like writing are a real challenge for LLMs (even powerful ones like Gemini 1.5 Pro, Claude 3.5 Sonnet, and GPT-4o).
02.04.2025 09:51 β π 4 π 0 π¬ 1 π 0
New paper from our team @GoogleDeepMind!
π¨ We've put LLMs to the test as writing co-pilots β how good are they really at helping us write? LLMs are increasingly used for open-ended tasks like writing assistance, but how do we assess their effectiveness? π€
arxiv.org/pdf/2503.19711
02.04.2025 09:51 β π 20 π 8 π¬ 1 π 1
you're telling me a cherry picked this example?
01.01.2025 14:27 β π 158 π 15 π¬ 2 π 0
Instead of listing my publications, as the year draws to an end, I want to shine the spotlight on the commonplace assumption that productivity must always increase. Good research is disruptive and thinking time is central to high quality scholarship and necessary for disruptive research.
20.12.2024 11:18 β π 1154 π 375 π¬ 21 π 57
postdoc @ oxford robotics institute. interested in reinforcement learning, graphs, robots, and combinatorial optimization.
https://victor.darvariu.me
Developing tools, algorithms and workflows for geometric rationalization, material-aware design, and digital fabrication. Also curious about embodied cognition, meaning-making, and art-science fusions.
Assistant professor in NLP @UniMelb
Multidisciplinary network of language scientists at the University of Cambridge
PhD student @ ETH ZΓΌrich | all aspects of NLP but mostly evaluation and MT | go vegan | https://vilda.net
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
ML & NLP at Google DeepMind
Research Engineer at Google DeepMind
A special snowflake existing in 196883 dimensions
#ActuallyAutistic He/Him
MTS @ Cohere on code. Views not my employerβs.
Waiting on a robot body. All opinions are universal and held by both employers and family.
Literally a professor. Recruiting students to start my lab.
ML/NLP/they/she.
Assistant Prof. at the Technion.
Computational Psycholinguistics, NLP, Cognitive Science.
https://lacclab.github.io/
I work on speech and language technologies at Google. I like languages, history, maps, traveling, cycling, and buying way too many books.
linguist turned NLP researcher, PhD student @cambridgenlp
Gemini Post-Training β«οΈ Research Scientist at Google DeepMind β«οΈ PhD from ETH Zurich
Getting paid to complain about LLM Evaluation at Cohere. #NLP #NLProc
https://dennis-aumiller.de
https://julienposture.com/
https://julienposture.substack.com/
Illustrator & PhD candidate in anthropology, University of Cambridge
I study how people and machines look together and against each other. (he/him)
Visual culture|semiotics|STS
Design engineer playing with AI and hacky prototypes @githubnext.com
Adores digital gardening, end-user development, and embodied cognition. Makes visual essays about design, programming, and anthropology.
π London
π± maggieappleton.com
website: https://t.co/ml5yPJjZLO Natural Language Processing and Machine Learning researcher at the University of Cambridge. Member of the PaNLP group: https://www.panlp.org/ and fellow of Fitzwilliam College.
ML @ Biographica, working on sustainable agriculture | Finalising #NLProc PhD at Cambridge