Yoshinari Fujinuma's Avatar

Yoshinari Fujinuma

@akkikiki.bsky.social

Member of Technical Staff@Cantina Labs (NYC); CS PhD @CUBoulder; Posts are my own; Website: https://akkikiki.github.io/ Substack: http://substack.com/@akkikiki Lived: πŸ‡ΉπŸ‡­πŸ‡―πŸ‡΅πŸ‡«πŸ‡·πŸ‡ΊπŸ‡Έ Posts: JA/EN

278 Followers  |  423 Following  |  22 Posts  |  Joined: 05.10.2023  |  1.5005

Latest posts by akkikiki.bsky.social on Bluesky

Video thumbnail

I played around with the mlx and 4-bit version of Qwen3-30B-A3B locally on an Apple M4 Max chip, with Japanese and English. This is amazing. It seems feasible to run it locally for tasks other than long-horizon or complex ones.

30.04.2025 21:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Oh yeah, I can easily imagine that'll be outputted by the model πŸ™‚

22.01.2025 18:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

To be clear, the recipe to replicate o1 style models is not new techniques, but applying them in a new way.
This shouldn't be surprising.

21.01.2025 15:46 β€” πŸ‘ 34    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1
Post image


I've just played around with DeepSeek-R1 and wow, such a long thoughts for a simple question "What is the square root of 16?" πŸ˜€

21.01.2025 19:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

My LinkedIn feed is full of AWS re:Invent posts (since I work at AWS, and many colleagues share about it), Twitter/X is a mixture of everything, and Bluesky posts are mostly academic. Welcome to the filter bubbles!

04.12.2024 05:16 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I was taking a stab at the responding to author discussions for ARR for the last few days, but some common issues I see is that submitted drafts are pretty exaggerating how good the results are.

29.11.2024 05:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Now Hear This: World’s Most Flexible Sound Machine Debuts Fugatto generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.

Where is the open-weight model that we can try it out? πŸ˜€
blogs.nvidia.com/blog/fugatto...

26.11.2024 03:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Look who's here
@pnas.org βœ”οΈ
@science.org βœ”οΈ
@naturecellbiology.bsky.social βœ”οΈ
@natrevgenet.bsky.social βœ”οΈ
@naturebiotech.bsky.social βœ”οΈ
@naturemicrobiol.bsky.social βœ”οΈ
@naturechemistry.bsky.social βœ”οΈ
@genesdev.bsky.social βœ”οΈ
@cellchembiol.bsky.social βœ”οΈ
@genomeresearch.bsky.social βœ”οΈ
@jcellbiol.bsky.social βœ”οΈ

25.11.2024 13:48 β€” πŸ‘ 504    πŸ” 237    πŸ’¬ 34    πŸ“Œ 20
Post image

I like typeset.io/pdf-to-video 's pdf-to-video feature for getting a quick overview of the paper. Looking forward to having more fine-grained video (or even customizable controlled generation of pdf video summary) version of it πŸ˜€

26.11.2024 01:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Just a heads up to everyone: @deep-mind.bsky.social is unfortunately a fake account and has been reported. Please do not follow it nor repost anything from it.

25.11.2024 23:24 β€” πŸ‘ 82    πŸ” 34    πŸ’¬ 9    πŸ“Œ 3

Here's the starter pack for AI/ML/NLP conferences that I was able to find as of now. I couldn't remove myself from the starter pack so feel free to unfollow me after hitting the "follow all" button πŸ™‚ go.bsky.app/9QQXJ1u

23.11.2024 01:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
AI Bluesky Join the conversation

Great AI people starter pack from @chris.bsky.social!

go.bsky.app/KRsy8pF

22.11.2024 11:54 β€” πŸ‘ 73    πŸ” 12    πŸ’¬ 7    πŸ“Œ 2

πŸ“£ I am sure we have reached only a small fraction of New York's ML community in bsky. Please repost πŸ” this if you think you may have interested people close to you in the social graph.

22.11.2024 14:14 β€” πŸ‘ 20    πŸ” 7    πŸ’¬ 2    πŸ“Œ 1

Someone should really treat me some coffee for asking me to assign & finish the emergency review within a day πŸ˜€

22.11.2024 19:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

1. Find your friends! I've found most of mine with:

- Starter packs blueskydirectory.com/starter-pack...
- the Chrome extension 'Sky Follower Bridge' www.sky-follower-bridge.dev
- @theo.io's Follow Finder, which lists people who are followed by lots of people you follow bsky-follow-finder.theo.io

20.11.2024 19:44 β€” πŸ‘ 223    πŸ” 29    πŸ’¬ 12    πŸ“Œ 5
Preview
Does your LLM truly unlearn? An embarrassingly simple approach to recover unlearned knowledge Large language models (LLMs) have shown remarkable proficiency in generating text, benefiting from extensive training on vast textual corpora. However, LLMs may also acquire unwanted behaviors from th...

Today's paper reading: Interesting, quantization can reverse unlearning up to 83% arxiv.org/abs/2410.16454

21.11.2024 02:43 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I did a starter pack of people in New York (City) working on ML/AI. Please distribute and feel free to self nominate!

go.bsky.app/BoEtagz

19.11.2024 01:38 β€” πŸ‘ 86    πŸ” 19    πŸ’¬ 42    πŸ“Œ 6

@ramon-astudillo.bsky.social self-nominating myself :)

20.11.2024 07:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Knowledge distillation (KD) is widely used for compressing a teacher model to reduce its inference cost and memory footprint, by training a smaller student model. However, current KD methods for auto-...

Today's paper reading: arxiv.org/abs/2306.13649

20.11.2024 05:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Too many people... @Shibuya station, Tokyo, Japan

23.12.2023 08:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Wow, more than 2000 papers were accepted in total for EMNLP

08.12.2023 02:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Looking forward to catching up with old friends and meeting new friends :)

References:
[1] arxiv.org/pdf/2305.112...
[2] arxiv.org/pdf/2310.163...
[3] aclanthology.org/2023.conll-1...

06.12.2023 07:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

[3] Bonus. 12/7 1:45pm Though I'm not the author, I'll be helping out presenting the poster at #CoNLL co-authored by my colleague titled "Cross-Document Event Coreference Resolution: Instruct Humans or Instruct GPT?"
(my first attempt and let's see how this turns out :) )

06.12.2023 07:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Heading to #EMNLP ! Co-authored papers πŸ‘‡
[1] 12/9 11am in-person poster by Sharon Levy title "Comparing Biases and the Impact of Multilingual Training across Multiple Languages"

[2] 12/8 2pm virtual poster titled "A Multi-Modal Multilingual Benchmark for Document Image Classification"

06.12.2023 07:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Resolving Latex errors for uploading to arxiv...

25.10.2023 04:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

GPT-4V just fixed my circuit breaker (where I had been struggling for 10+ mins at midnight)

24.10.2023 02:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Done finishing up the EMNLP findings camera ready

21.10.2023 02:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I finally read the attention sink paper [1] and the HF blog article [2]. Seems like another interesting data point that the models we usually interact with strongly attend to the first few tokens...
[1] arxiv.org/abs/2309.17453
[2] huggingface.co/blog/tomaars...

19.10.2023 01:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
COLM 2024

New conference alert! COLM (β€œcollum”) seeks a broad range of work on language modeling. 9 pages due Mar 8: colmweb.org

16.10.2023 16:42 β€” πŸ‘ 11    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0

Preparing an EMNLP camera ready version of our accepted paper ✍️✍️✍️

14.10.2023 22:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@akkikiki is following 20 prominent accounts