Chengzu @chengzu-li - Bluesky Profile

Latest posts by chengzu-li.bsky.social on Bluesky

Round of applause for the fantastic collaborators in this project: Wenshan Wu, Huanyu Zhang, Yan Xia, Shaoguang Mao, Li Dong, Ivan Vulić and Furu Wei🥳🥳

14.01.2025 14:50 — 👍 2 🔁 0 💬 0 📌 0

Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Chain-of-Thought (CoT) prompting has proven highly effective for enhancing complex reasoning in Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs). Yet, it struggles in complex ...

📄 Dive Deeper into MVoT

Discover how MVoT rewrites the rules with details like loss design, image tokenization and interleaved multimodal training.
👉Read our paper on arXiv: arxiv.org/abs/2501.07542

14.01.2025 14:50 — 👍 1 🔁 0 💬 1 📌 0

🔗 MVoT + CoT: New Ceiling for Reasoning

MVoT doesn’t replace CoT—it elevates it. By combining MVoT and CoT, the synergy of multimodal reasoning and verbal reasoning unlocks the performance upper bound, proving that two reasoning paradigms are potentially better than one!

14.01.2025 14:50 — 👍 0 🔁 0 💬 1 📌 0

🎨 Revolutionizing Visual Reasoning with Token Discrepancy Loss

Messy visuals? Not anymore. Our token discrepancy loss ensures that MVoT generates accurate, meaningful visualizations with less redundancy.

Result? Better images, clearer reasoning, stronger performance.

14.01.2025 14:50 — 👍 0 🔁 0 💬 1 📌 0

🎯 Performance Boosts with MVoT

MVoT isn’t just new—it’s better.
🔥 Better and more stable performance than CoT, particularly in complex scenarios like FrozenLake.
🌟 Plug-and-play power: Supercharges models like GPT-4o for unprecedented versatility.

14.01.2025 14:50 — 👍 0 🔁 0 💬 1 📌 0

🧠MVoT

MVoT moves beyond Chain-of-Thought (CoT) to enable AI to imagine what it thinks with generated visual images. By blending verbal and visual reasoning, MVoT makes tackling complex problems more intuitive, interpretable, and powerful.

14.01.2025 14:50 — 👍 0 🔁 0 💬 1 📌 0

Forget just thinking in words.

🔔Our New Preprint:
🚀 New Era of Multimodal Reasoning🚨
🔍 Imagine While Reasoning in Space with MVoT

Multimodal Visualization-of-Thought (MVoT) revolutionizes reasoning by generating visual "thoughts" that transform how AI thinks, reasons, and explains itself.

14.01.2025 14:50 — 👍 6 🔁 1 💬 1 📌 0

Hi would love to be added in the list! Thanks!

05.12.2024 15:06 — 👍 1 🔁 0 💬 0 📌 0

🙋working on VLMs and would love to be added! Thanks!

05.12.2024 15:02 — 👍 1 🔁 0 💬 1 📌 0

@chengzu-li is following 20 prominent accounts

@hsterz

Benjamin Minixhofer
@bminixhofer

Serge Belongie
@serge.belongie.com

Professor, University Of Copenhagen 🇩🇰 PI @belongielab.org 🕵️‍♂️ Director @aicentre.dk 🤖 Board member @ellis.eu 🇪🇺 Formerly: Cornell, Google, UCSD #ComputerVision #MachineLearning

@ej-ltl

Panagiotis (Panos) Fytas
@pfytas

NLP PhD Student @ University of Cambridge

Paul
@notpaulmartin

NLP PhD @ Cambridge Language Technology Lab paulsbitsandbytes.com

Chen Cecilia Liu
@ccliu

ELLIS @ellis.eu Ph.D student at @ukplab.bsky.social and @Cambridge_Uni | Prev. @UofT PSI Lab 🇨🇦 Multicultural, Multilingual, Generalization https://ccliu2.github.io/

Maria Antoniak
@mariaa

☀️ Assistant Professor of Computer Science at CU Boulder 👩‍💻 NLP, cultural analytics, narratives, online communities 🌐 https://maria-antoniak.github.io 💬 books, bikes, games, art

michael ginn
@mginn

compling phd student @ boulder rare languages, morphology, finite state automata michaelginn.com

The Data Therapist in the Blue Sky
@datatherapist

#NLP / #NLProc , #dataScience, #AI / #ArtificialIntelligence, #linguistics (#syntax, #semantics, …), occasional #parenting, #gardening, & what not. PhD. Adjunct prof once in a full red moon. Industry / technical mentor. Not my opinion, never my employer’s

Leshem (Legend) Choshen @EMNLP
@lchoshen

🥇 LLMs together (co-created model merging, BabyLM, textArena.ai) 🥈 Spreading science over hype in #ML & #NLP Proud shareLM💬 Donor @IBMResearch & @MIT_CSAIL

NLPurr
@nlpurr

SciComm of Academic NLP Papers | Research Scientist | Explainability, Prompting, Benchmarking, Metrics, Red-Teaming & Eval of LLMs https://nlpurr.github.io/

Melanie Mitchell
@melaniemitchell

Professor, Santa Fe Institute. Research on AI, cognitive science, and complex systems. Website: https://melaniemitchell.me Substack: https://aiguide.substack.com/

Yonatan Bisk
@ybisk.me

Assistant Professor confused by the concept of consciousness but talkingtorobots.com in the meantime

Yoav Artzi
@yoavartzi.com

LM/NLP/ML researcher ¯\_(ツ)_/¯ yoavartzi.com / associate professor @ Cornell CS + Cornell Tech campus @ NYC / nlp.cornell.edu / associate faculty director @ arXiv.org / researcher @ ASAPP / starting @colmweb.org / building RecNet.io

Asad Sayeed (also @asayeed[@zirk.us])
@asayeed

Thought leader, whose main act of thought leadership is to declare myself thought leader. Computational psycholinguist at the University of Gothenburg.

Harsh Trivedi
@harsh3vedi

🤖 Building AI agents & interactive environments: 🌍 AppWorld (https://appworld.dev) #NLProc PhD @stonybrooku. Past intern Allen AI & visitor CILVR at NYU. 🐦 https://x.com/harsh3vedi 🌐 https://harshtrivedi.me/

Sagnik Mukherjee
@sagnikmukherjee

NLP PhD student @convai_uiuc | Agents, Reasoning, evaluation etc. https://sagnikmukherjee.github.io https://scholar.google.com/citations?user=v4lvWXoAAAAJ&hl=en

ACL
@aclmeeting

The Association for Computational Linguistics (ACL) is a scientific and professional organization for people working on Natural Language Processing/Computational Linguistics. Hash tags: #NLProc #ACL2025NLP

AACL
@aaclmeeting

The Asia-Pacific Chapter of the Association for Computational Linguistics The 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (AACL 2025) https://www.afnlp.org/conferences/ijcnlp2025 #AACL2025 #NLProc #NLP