Yoshinari Fujinuma @akkikiki

I played around with the mlx and 4-bit version of Qwen3-30B-A3B locally on an Apple M4 Max chip, with Japanese and English. This is amazing. It seems feasible to run it locally for tasks other than long-horizon or complex ones.

30.04.2025 21:42 — 👍 1 🔁 0 💬 0 📌 0

Oh yeah, I can easily imagine that'll be outputted by the model 🙂

22.01.2025 18:03 — 👍 0 🔁 0 💬 0 📌 0

To be clear, the recipe to replicate o1 style models is not new techniques, but applying them in a new way.
This shouldn't be surprising.

21.01.2025 15:46 — 👍 34 🔁 5 💬 1 📌 1

I've just played around with DeepSeek-R1 and wow, such a long thoughts for a simple question "What is the square root of 16?" 😀

21.01.2025 19:43 — 👍 1 🔁 0 💬 1 📌 0

My LinkedIn feed is full of AWS re:Invent posts (since I work at AWS, and many colleagues share about it), Twitter/X is a mixture of everything, and Bluesky posts are mostly academic. Welcome to the filter bubbles!

04.12.2024 05:16 — 👍 2 🔁 0 💬 0 📌 0

I was taking a stab at the responding to author discussions for ARR for the last few days, but some common issues I see is that submitted drafts are pretty exaggerating how good the results are.

29.11.2024 05:17 — 👍 0 🔁 0 💬 0 📌 0

Now Hear This: World’s Most Flexible Sound Machine Debuts Fugatto generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.

Where is the open-weight model that we can try it out? 😀
blogs.nvidia.com/blog/fugatto...

26.11.2024 03:28 — 👍 0 🔁 0 💬 0 📌 0

Look who's here
@pnas.org ✔️
@science.org ✔️
@naturecellbiology.bsky.social ✔️
@natrevgenet.bsky.social ✔️
@naturebiotech.bsky.social ✔️
@naturemicrobiol.bsky.social ✔️
@naturechemistry.bsky.social ✔️
@genesdev.bsky.social ✔️
@cellchembiol.bsky.social ✔️
@genomeresearch.bsky.social ✔️
@jcellbiol.bsky.social ✔️

25.11.2024 13:48 — 👍 504 🔁 237 💬 34 📌 20

I like typeset.io/pdf-to-video 's pdf-to-video feature for getting a quick overview of the paper. Looking forward to having more fine-grained video (or even customizable controlled generation of pdf video summary) version of it 😀

26.11.2024 01:22 — 👍 1 🔁 0 💬 0 📌 0

Just a heads up to everyone: @deep-mind.bsky.social is unfortunately a fake account and has been reported. Please do not follow it nor repost anything from it.

25.11.2024 23:24 — 👍 82 🔁 34 💬 9 📌 3

Here's the starter pack for AI/ML/NLP conferences that I was able to find as of now. I couldn't remove myself from the starter pack so feel free to unfollow me after hitting the "follow all" button 🙂 go.bsky.app/9QQXJ1u

23.11.2024 01:49 — 👍 1 🔁 0 💬 0 📌 0

AI Bluesky Join the conversation

Great AI people starter pack from @chris.bsky.social!

go.bsky.app/KRsy8pF

22.11.2024 11:54 — 👍 73 🔁 12 💬 7 📌 2

📣 I am sure we have reached only a small fraction of New York's ML community in bsky. Please repost 🔁 this if you think you may have interested people close to you in the social graph.

22.11.2024 14:14 — 👍 20 🔁 7 💬 2 📌 1

Someone should really treat me some coffee for asking me to assign & finish the emergency review within a day 😀

22.11.2024 19:43 — 👍 1 🔁 0 💬 0 📌 0

1. Find your friends! I've found most of mine with:

- Starter packs blueskydirectory.com/starter-pack...
- the Chrome extension 'Sky Follower Bridge' www.sky-follower-bridge.dev
- @theo.io's Follow Finder, which lists people who are followed by lots of people you follow bsky-follow-finder.theo.io

20.11.2024 19:44 — 👍 223 🔁 29 💬 12 📌 5

Does your LLM truly unlearn? An embarrassingly simple approach to recover unlearned knowledge Large language models (LLMs) have shown remarkable proficiency in generating text, benefiting from extensive training on vast textual corpora. However, LLMs may also acquire unwanted behaviors from th...

Today's paper reading: Interesting, quantization can reverse unlearning up to 83% arxiv.org/abs/2410.16454

21.11.2024 02:43 — 👍 2 🔁 0 💬 0 📌 0

I did a starter pack of people in New York (City) working on ML/AI. Please distribute and feel free to self nominate!

go.bsky.app/BoEtagz

19.11.2024 01:38 — 👍 86 🔁 19 💬 42 📌 6

@ramon-astudillo.bsky.social self-nominating myself :)

20.11.2024 07:35 — 👍 0 🔁 0 💬 1 📌 0

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Knowledge distillation (KD) is widely used for compressing a teacher model to reduce its inference cost and memory footprint, by training a smaller student model. However, current KD methods for auto-...

Today's paper reading: arxiv.org/abs/2306.13649

20.11.2024 05:37 — 👍 0 🔁 0 💬 0 📌 0

Too many people... @Shibuya station, Tokyo, Japan

23.12.2023 08:56 — 👍 0 🔁 0 💬 0 📌 0

Wow, more than 2000 papers were accepted in total for EMNLP

08.12.2023 02:16 — 👍 0 🔁 0 💬 0 📌 0

Looking forward to catching up with old friends and meeting new friends :)

References:
[1] arxiv.org/pdf/2305.112...
[2] arxiv.org/pdf/2310.163...
[3] aclanthology.org/2023.conll-1...

06.12.2023 07:56 — 👍 0 🔁 0 💬 0 📌 0

[3] Bonus. 12/7 1:45pm Though I'm not the author, I'll be helping out presenting the poster at #CoNLL co-authored by my colleague titled "Cross-Document Event Coreference Resolution: Instruct Humans or Instruct GPT?"
(my first attempt and let's see how this turns out :) )

06.12.2023 07:52 — 👍 0 🔁 0 💬 1 📌 0

Heading to #EMNLP ! Co-authored papers 👇
[1] 12/9 11am in-person poster by Sharon Levy title "Comparing Biases and the Impact of Multilingual Training across Multiple Languages"

[2] 12/8 2pm virtual poster titled "A Multi-Modal Multilingual Benchmark for Document Image Classification"

06.12.2023 07:51 — 👍 2 🔁 0 💬 1 📌 0

Resolving Latex errors for uploading to arxiv...

25.10.2023 04:26 — 👍 0 🔁 0 💬 0 📌 0

GPT-4V just fixed my circuit breaker (where I had been struggling for 10+ mins at midnight)

24.10.2023 02:20 — 👍 1 🔁 0 💬 0 📌 0

Done finishing up the EMNLP findings camera ready

21.10.2023 02:39 — 👍 1 🔁 0 💬 0 📌 0

I finally read the attention sink paper [1] and the HF blog article [2]. Seems like another interesting data point that the models we usually interact with strongly attend to the first few tokens...
[1] arxiv.org/abs/2309.17453
[2] huggingface.co/blog/tomaars...

19.10.2023 01:18 — 👍 0 🔁 0 💬 0 📌 0

COLM 2024

New conference alert! COLM (“collum”) seeks a broad range of work on language modeling. 9 pages due Mar 8: colmweb.org

16.10.2023 16:42 — 👍 11 🔁 6 💬 0 📌 0

Preparing an EMNLP camera ready version of our accepted paper ✍️✍️✍️

14.10.2023 22:38 — 👍 0 🔁 0 💬 0 📌 0

Yoshinari Fujinuma

Latest posts by akkikiki.bsky.social on Bluesky

@akkikiki is following 20 prominent accounts