I played around with the mlx and 4-bit version of Qwen3-30B-A3B locally on an Apple M4 Max chip, with Japanese and English. This is amazing. It seems feasible to run it locally for tasks other than long-horizon or complex ones.
30.04.2025 21:42 β π 1 π 0 π¬ 0 π 0
Oh yeah, I can easily imagine that'll be outputted by the model π
22.01.2025 18:03 β π 0 π 0 π¬ 0 π 0
To be clear, the recipe to replicate o1 style models is not new techniques, but applying them in a new way.
This shouldn't be surprising.
21.01.2025 15:46 β π 34 π 5 π¬ 1 π 1
I've just played around with DeepSeek-R1 and wow, such a long thoughts for a simple question "What is the square root of 16?" π
21.01.2025 19:43 β π 1 π 0 π¬ 1 π 0
My LinkedIn feed is full of AWS re:Invent posts (since I work at AWS, and many colleagues share about it), Twitter/X is a mixture of everything, and Bluesky posts are mostly academic. Welcome to the filter bubbles!
04.12.2024 05:16 β π 2 π 0 π¬ 0 π 0
I was taking a stab at the responding to author discussions for ARR for the last few days, but some common issues I see is that submitted drafts are pretty exaggerating how good the results are.
29.11.2024 05:17 β π 0 π 0 π¬ 0 π 0
Look who's here
@pnas.org βοΈ
@science.org βοΈ
@naturecellbiology.bsky.social βοΈ
@natrevgenet.bsky.social βοΈ
@naturebiotech.bsky.social βοΈ
@naturemicrobiol.bsky.social βοΈ
@naturechemistry.bsky.social βοΈ
@genesdev.bsky.social βοΈ
@cellchembiol.bsky.social βοΈ
@genomeresearch.bsky.social βοΈ
@jcellbiol.bsky.social βοΈ
25.11.2024 13:48 β π 504 π 237 π¬ 34 π 20
I like typeset.io/pdf-to-video 's pdf-to-video feature for getting a quick overview of the paper. Looking forward to having more fine-grained video (or even customizable controlled generation of pdf video summary) version of it π
26.11.2024 01:22 β π 1 π 0 π¬ 0 π 0
Just a heads up to everyone: @deep-mind.bsky.social is unfortunately a fake account and has been reported. Please do not follow it nor repost anything from it.
25.11.2024 23:24 β π 82 π 34 π¬ 9 π 3
Here's the starter pack for AI/ML/NLP conferences that I was able to find as of now. I couldn't remove myself from the starter pack so feel free to unfollow me after hitting the "follow all" button π go.bsky.app/9QQXJ1u
23.11.2024 01:49 β π 1 π 0 π¬ 0 π 0
AI Bluesky
Join the conversation
Great AI people starter pack from @chris.bsky.social!
go.bsky.app/KRsy8pF
22.11.2024 11:54 β π 73 π 12 π¬ 7 π 2
π£ I am sure we have reached only a small fraction of New York's ML community in bsky. Please repost π this if you think you may have interested people close to you in the social graph.
22.11.2024 14:14 β π 20 π 7 π¬ 2 π 1
Someone should really treat me some coffee for asking me to assign & finish the emergency review within a day π
22.11.2024 19:43 β π 1 π 0 π¬ 0 π 0
1. Find your friends! I've found most of mine with:
- Starter packs blueskydirectory.com/starter-pack...
- the Chrome extension 'Sky Follower Bridge' www.sky-follower-bridge.dev
- @theo.io's Follow Finder, which lists people who are followed by lots of people you follow bsky-follow-finder.theo.io
20.11.2024 19:44 β π 223 π 29 π¬ 12 π 5
I did a starter pack of people in New York (City) working on ML/AI. Please distribute and feel free to self nominate!
go.bsky.app/BoEtagz
19.11.2024 01:38 β π 86 π 19 π¬ 42 π 6
@ramon-astudillo.bsky.social self-nominating myself :)
20.11.2024 07:35 β π 0 π 0 π¬ 1 π 0
Too many people... @Shibuya station, Tokyo, Japan
23.12.2023 08:56 β π 0 π 0 π¬ 0 π 0
Wow, more than 2000 papers were accepted in total for EMNLP
08.12.2023 02:16 β π 0 π 0 π¬ 0 π 0
Looking forward to catching up with old friends and meeting new friends :)
References:
[1] arxiv.org/pdf/2305.112...
[2] arxiv.org/pdf/2310.163...
[3] aclanthology.org/2023.conll-1...
06.12.2023 07:56 β π 0 π 0 π¬ 0 π 0
[3] Bonus. 12/7 1:45pm Though I'm not the author, I'll be helping out presenting the poster at #CoNLL co-authored by my colleague titled "Cross-Document Event Coreference Resolution: Instruct Humans or Instruct GPT?"
(my first attempt and let's see how this turns out :) )
06.12.2023 07:52 β π 0 π 0 π¬ 1 π 0
Heading to #EMNLP ! Co-authored papers π
[1] 12/9 11am in-person poster by Sharon Levy title "Comparing Biases and the Impact of Multilingual Training across Multiple Languages"
[2] 12/8 2pm virtual poster titled "A Multi-Modal Multilingual Benchmark for Document Image Classification"
06.12.2023 07:51 β π 2 π 0 π¬ 1 π 0
Resolving Latex errors for uploading to arxiv...
25.10.2023 04:26 β π 0 π 0 π¬ 0 π 0
GPT-4V just fixed my circuit breaker (where I had been struggling for 10+ mins at midnight)
24.10.2023 02:20 β π 1 π 0 π¬ 0 π 0
Done finishing up the EMNLP findings camera ready
21.10.2023 02:39 β π 1 π 0 π¬ 0 π 0
I finally read the attention sink paper [1] and the HF blog article [2]. Seems like another interesting data point that the models we usually interact with strongly attend to the first few tokens...
[1] arxiv.org/abs/2309.17453
[2] huggingface.co/blog/tomaars...
19.10.2023 01:18 β π 0 π 0 π¬ 0 π 0
COLM 2024
New conference alert! COLM (βcollumβ) seeks a broad range of work on language modeling. 9 pages due Mar 8: colmweb.org
16.10.2023 16:42 β π 11 π 6 π¬ 0 π 0
Preparing an EMNLP camera ready version of our accepted paper βοΈβοΈβοΈ
14.10.2023 22:38 β π 0 π 0 π¬ 0 π 0
Phd candidate @Purdue. I work on problems in information theory as well as on Ranking and Preference learning
I am the co-founder of @cactuscon | ex @bishopfox ex @spiderlabs | currently pursuing a phd at UdeG in ML & OffSec. https://sensecurity.io
slayer of applications | not a super villain
Intern @Google, Ph.D. Student @Cornell_CS.
Interested in machine learning, LLM, brain, and healthcare.
abehrouz.github.io
AI professor at Caltech. General Chair ICLR 2025.
http://www.yisongyue.com
Association for Uncertainty in AI.
Upcoming conference: #uai2025 July 21-25th in Rio de Janeiro, Brazil π§π· !
https://auai.org/uai2025
information science professor (tech ethics + internet stuff)
kind of a content creator (elsewhere also @professorcasey)
though not influencing anyone to do anything except maybe learn things
she/her
more: casey.prof
Writing a book on AI+economics+geopolitics for Nation Books.
Covers: The Nation, Jacobin. Bylines: NYT, Nature, Bloomberg, BBC, Guardian, TIME, The Verge, Vox, Thomson Reuters Foundation, + others.
METR is a research nonprofit that builds evaluations to empirically test AI systems for capabilities that could threaten catastrophic harm to society.
Recently a principal scientist at Google DeepMind. Joining Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamical systems.
Anthropic and Import AI. Previously OpenAI, Bloomberg, The Register. Weird futures.
Researcher at Anthropic. Based in SF. Likes cats. smsharma.github.io.
policy for v smart things @openai. Past: PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Posts my own; on my head be it
I work on AI at OpenAI.
Former VP AI and Distinguished Scientist at Microsoft.
Founder & executive & community builder & organizer & researcher
ML Collective (mlcollective.org)
Google DeepMind
rosanneliu.com
I train models @ OpenAI.
Previously Research at DeepMind.
Hae sententiae verbaque mihi soli sunt.
CTO, AllenInstitute.org. Advisor, Fathom.org. He/him. Working at intersection of AI and biology.
Research Scientist at the Allen Institute for AI (AI2), interested in information extraction, NLP for healthcare and transfer learning, PhD from CMU LTI. Website: https://www.cs.cmu.edu/~anaik/