Adina Yakup @adinayakup - Bluesky Profile

baidu/ERNIE-4.5-21B-A3B-Thinking · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Baidu just released a reasoning model 🔥 ERNIE-4.5-21B-A3B-Thinking

huggingface.co/baidu/ERNIE-...

✨ Small MoE - Apache 2.0
✨ 128K context length for deep reasoning
✨ Efficient tool usage capabilities

09.09.2025 06:25 — 👍 16 🔁 4 💬 0 📌 0

baidu/ERNIE-4.5-21B-A3B-Thinking · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Baidu just released a thinking model 🔥 ERNIE-4.5-21B-A3B-Thinking

huggingface.co/baidu/ERNIE-...

✨ Small MoE - Apache 2.0
✨ 128K context length for deep reasoning
✨ Efficient tool usage capabilities

09.09.2025 06:21 — 👍 3 🔁 0 💬 0 📌 0

openbmb/MiniCPM4.1-8B · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

MiniCPM4.1🔥New edge-side LLM built for efficiency + reasoning from OpenBMB

huggingface.co/openbmb/Mini...

✨ 8B - Apache 2.0
✨ Hybrid reasoning model: deep reasoning +fast inference.
✨5x faster on edge chips, 90% smaller (BitCPM)
✨Trained on UltraClean + UltraChat v2 data

06.09.2025 13:56 — 👍 5 🔁 2 💬 0 📌 0

m-a-p/Inverse_IFEval · Datasets at Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Inverse IFEval 🔥New benchmark from Bytedance & MAP

huggingface.co/datasets/m-a...
huggingface.co/papers/2509....

Testing LLMs on their ability to override biases & follow adversarial instructions.
✨ 8 challenge types
✨ 1,012 CN/EN Qs across 23 domains
✨ Human-in-the-loop + LLM-as-a-Judge

05.09.2025 14:28 — 👍 0 🔁 0 💬 0 📌 0

Chinese Open Source Heatmap - a Hugging Face Space by zh-ai-community This application shows a heatmap of the most active AI labs and organizations on Hugging Face, highlighting their recent model releases. Users can see which labs are most active and their contribut...

Added the Kwai team from Kuaishou in the heatmap
huggingface.co/spaces/zh-ai...

05.09.2025 14:07 — 👍 1 🔁 0 💬 0 📌 0

Klear1.0 - a Kwai-Klear Collection Klear1.0

Klear-46B-A2.5🔥 a sparse MoE LLM developed by the Kwai-Klear Team at Kuaishou

huggingface.co/collections/...

✨ 46B total / 2.5B active - Apache2.0
✨ Dense-level performance at lower cost
✨ Trained on 22T tokens with progressive curriculum
✨ 64K context length

05.09.2025 13:42 — 👍 5 🔁 2 💬 2 📌 0

moonshotai/Kimi-K2-Instruct-0905 · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Latest update from Moonshot AI

Kimi K2 >>> Kimi K2-Instruct-0905🔥

huggingface.co/moonshotai/K...

✨ 32B activated / 1T total parameters
✨ Enhanced agentic coding intelligence
✨ Better frontend coding experience
✨ 256K context window for long horizon tasks

05.09.2025 07:09 — 👍 6 🔁 2 💬 0 📌 0

✨ Supports 33 languages, including 5 ethnic minority languages in China 👀
✨ Including a translation ensemble model: Chimera-7B
✨ Full pipeline: pretrain > CPT > SFT > enhancement > ensemble refinement > SOTA performance at similar scale

01.09.2025 09:15 — 👍 5 🔁 1 💬 0 📌 0

Hunyuan-MT - a tencent Collection We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hunyuan-MT-7B 🔥 open translation model released by Tencent

huggingface.co/collections/...

01.09.2025 09:15 — 👍 7 🔁 1 💬 2 📌 0

✨ 560B total / ~27B active MoE — MIT license
✨ 128k context length + advanced reasoning
✨ ScMoE design: 100+ TPS inference
✨ Stable large-scale training + strong agentic performance

01.09.2025 08:26 — 👍 0 🔁 0 💬 0 📌 0

meituan-longcat/LongCat-Flash-Chat · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

From food delivery to frontier AI 🚀 Meituan, the leading lifestyle platform just dropped its first open SoTA LLM: LongCat-Flash 🔥

huggingface.co/meituan-long...

01.09.2025 08:26 — 👍 4 🔁 1 💬 1 📌 0

✨ Large-scale triplet dataset (content, style, stylized)
✨ Disentangled learning: style alignment + content preservation
✨ Style Reward Learning (SRL) for higher fidelity
✨ USO-Bench: 1st benchmark for style & subject jointly
✨ SOTA results on subject consistency & style similarity

29.08.2025 09:18 — 👍 0 🔁 0 💬 0 📌 0

USO - a Hugging Face Space by bytedance-research Create custom images by combining different styles and subjects. Upload content and style images, or use text prompts to generate photorealistic portraits or styled images.

USO 🎨 Unified customization model released by Bytedance research

Demo
huggingface.co/spaces/byted...
Model
huggingface.co/bytedance-re...
Paper
huggingface.co/papers/2508....

29.08.2025 09:18 — 👍 0 🔁 1 💬 1 📌 0

✨ Direct raw audio: text & speech ,no ASR+LLM+TTS pipeline
✨ High-IQ reasoning: RL + CoT for paralinguistic cues
✨ Multimodal RAG + tool calling
✨ Emotion, timbre, dialect & style control
✨ SOTA on ASR, paralinguistic, speech dialog

29.08.2025 08:44 — 👍 1 🔁 0 💬 0 📌 0

Step-Audio 2 - a stepfun-ai Collection We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Step-Audio 2🔥 New end to end multimodal LLM for audio & speech, released by StepFun

huggingface.co/collections/...

29.08.2025 08:44 — 👍 6 🔁 2 💬 1 📌 0

>Applications: AI-as-a-service, test bases, new standards
>Open-source: support communities, encourage contributions (incl. university credits & recognition), foster new application approaches, and build globally impactful ecosystems 👀
>Talent, policy & safety frameworks: secure sustainable growth

26.08.2025 11:41 — 👍 1 🔁 0 💬 0 📌 0

✨Highlights:
>Models: advance theory, efficient training/inference, evaluation system
>Data: high-quality datasets, IP/copyright reform, new incentives
>Compute: boost chips & clusters, improve national network, promote cloud standardization, and ensure inclusive, efficient, green, secure supply.

26.08.2025 11:41 — 👍 3 🔁 0 💬 1 📌 0

China AI policy research 🤗 - a Hugging Face Space by zh-ai-community Browse and filter through key AI policy documents from China, covering various topics like domestic development, international cooperation, safety, data management, industry standards, and ethics. ...

🇨🇳 China’s State Council just released its “AI+” Action Plan (2025)

huggingface.co/spaces/zh-ai...

✨Goal: By 2035, AI will deeply empower all sectors, reshape productivity & society

✨Focus on 6 pillars:
>Science & Tech
>Industry
>Consumption
>Public welfare
>Governance
>Global cooperation

26.08.2025 11:41 — 👍 11 🔁 0 💬 1 📌 0

✨ SOTA vision language capability
✨ 96× video token compression > high-FPS & long video reasoning
✨ Switchable fast vs deep thinking modes
✨ Strong OCR, document parsing, supports 30+ languages

26.08.2025 10:49 — 👍 1 🔁 0 💬 0 📌 0

openbmb/MiniCPM-V-4_5 · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

MiniCPM-V 4.5 🚀 New MLLM for image, multi-image & video understanding, running even on your phone, released by OpenBMB

huggingface.co/openbmb/Mini...

26.08.2025 10:49 — 👍 5 🔁 2 💬 1 📌 0

InternVL3.5 - a OpenGVLab Collection We’re on a journey to advance and democratize artificial intelligence through open source and open science.

InternVL3.5 🔥 New family of multimodal model by Shanghai AI lab @opengvlab

huggingface.co/collections/...

✨ 1B · 2B · 4B · 8B · 14B · 38B ｜ MoE → 20B-A4B · 30B-A3B · 241B-A28B 📄Apache 2.0
✨ +16% reasoning performance, 4.05× speedup vs InternVL3

26.08.2025 10:33 — 👍 3 🔁 1 💬 0 📌 0

internlm/Intern-S1-mini · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Intern-S1-mini 🔥 lightweight open multimodal reasoning model by Shanghai AI Lab.

huggingface.co/internlm/Int...

✨ Efficient 8B LLM + 0.3B vision encoder
✨ Apache 2.0
✨ 5T multimodal pretraining, 50%+ in scientific domains
✨ Dynamic tokenizer for molecules & protein sequences

21.08.2025 13:28 — 👍 4 🔁 1 💬 0 📌 0

Seed-OSS - a ByteDance-Seed Collection Seed-OSS Open-Source Models

Seed-OSS 🔥 The latest open LLM from Bytedance Seed team

huggingface.co/collections/...

✨ 36B - Base & Instruct
✨ Apache 2.0
✨ Native 512K long context
✨ Strong reasoning & agentic intelligence
✨ 2 Base versions: with & without synthetic data

20.08.2025 18:40 — 👍 3 🔁 0 💬 0 📌 0

DeepSeek-V3.1 - a deepseek-ai Collection We’re on a journey to advance and democratize artificial intelligence through open source and open science.

✨DeepSeek V3.1 just dropped on @hf.co

huggingface.co/collections/...

19.08.2025 15:46 — 👍 14 🔁 2 💬 0 📌 0

Qwen Image Edit - a Hugging Face Space by Qwen This application allows you to edit images by providing instructions. You upload an image and describe the changes you want, and the app generates the edited image for you. You can specify addition...

✨ Apache 2.0
✨ Semantic + Appearance Editing: rotate, restyle, add/remove
✨ Precise Text Editing → edit CN/EN text, keep style
huggingface.co/spaces/Qwen/...

18.08.2025 18:12 — 👍 2 🔁 1 💬 0 📌 0

Qwen/Qwen-Image-Edit · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Before my vacation: Qwen releasing.
When I came back: Qwen still releasing
Respect!!🫡

Qwen Image Edit 🔥 the image editing version of Qwen-Image by Alibaba Qwen

huggingface.co/Qwen/Qwen-Im...

18.08.2025 18:12 — 👍 10 🔁 2 💬 1 📌 0

🧩 July 2025 - Open works from the Chinese community - a zh-ai-community Collection We’re on a journey to advance and democratize artificial intelligence through open source and open science.

🔥 July highlights from Chinese AI community

huggingface.co/collections/...

I’ve been tracking things closely, but July’s open-source wave still managed to surprise me.
Can’t wait to see what’s coming next! 🚀

31.07.2025 20:43 — 👍 6 🔁 2 💬 0 📌 0

Qwen/Qwen3-Coder-30B-A3B-Instruct · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Qwen team did it again!🔥

They just released Qwen3-Coder-30B-A3B-Instruct on the hub

huggingface.co/Qwen/Qwen3-C...

✨ Apache 2.0
✨30B total / 3.3B active (128 experts, 8 top-k)
✨ Native 256K context, extendable to 1M via Yarn
✨ Built for Agentic Coding

31.07.2025 14:58 — 👍 3 🔁 2 💬 0 📌 0

✨ 321B total / 32B active - Apache 2.0
✨ MFA + AFD : cutting decoding cost by up to 70% vs. DeepSeek-V3
✨ 4T image-text pretraining: strong vision–language grounding
✨ Modular, efficient, deployable: runs on just 8×48GB GPUs

31.07.2025 13:39 — 👍 0 🔁 0 💬 0 📌 0

Paper page - Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding Join the discussion on this paper page

It’s here! After the WAIC announcement, StepFun has just dropped Step 3 🔥 their latest multimodal reasoning model on the hub.

Model: huggingface.co/stepfun-ai/s...
Paper: huggingface.co/papers/2507....

31.07.2025 13:39 — 👍 3 🔁 1 💬 1 📌 0

Adina Yakup

Latest posts by adinayakup.bsky.social on Bluesky

@adinayakup is following 20 prominent accounts