GitHub - deepseek-ai/DeepSeek-V3.2-Exp
Contribute to deepseek-ai/DeepSeek-V3.2-Exp development by creating an account on GitHub.
DeepSeek released V3.2, a model that harmonizes high computational efficiency with superior reasoning & agent performance. It surpasses GPT-5 and exhibits reasoning proficiency on par with Gemini-3.0-Pro.
π₯ Also it achieved Gold-medal performance in the 2025 IMO and IOI
github.com/deepseek-ai/...
02.12.2025 16:42 β π 0 π 0 π¬ 0 π 0
GitHub - Tongyi-MAI/Z-Image
Contribute to Tongyi-MAI/Z-Image development by creating an account on GitHub.
Alibaba releases a powerful image generator, Z-Image, with 6B parameters in three variants that shows highly competitive performance against other leading models, while achieving state-of-the-art results among open-source models.
github.com/Tongyi-MAI/Z...
28.11.2025 16:13 β π 8 π 2 π¬ 1 π 0
GitHub - PaddlePaddle/ERNIE: The official repository for ERNIE 4.5 and ERNIEKit β its industrial-grade development toolkit based on PaddlePaddle.
The official repository for ERNIE 4.5 and ERNIEKit β its industrial-grade development toolkit based on PaddlePaddle. - PaddlePaddle/ERNIE
π Baidu just released **ERNIE-4.5-VL-28B-A3B-Thinking** β open-source (Apache 2.0)!
β
3B active params
β
100% multimodal reasoning
β
Visual reasoning, STEM, video understanding & βThinking with Imagesβ
β
Tool use, precise grounding, dynamic zoom & search
π github.com/PaddlePaddle...
14.11.2025 11:48 β π 4 π 0 π¬ 0 π 0
GitHub - deepseek-ai/DeepSeek-OCR: Contexts Optical Compression
Contexts Optical Compression. Contribute to deepseek-ai/DeepSeek-OCR development by creating an account on GitHub.
DeepSeek-OCR (new, LLM-centric, research-focused) vs. PaddleOCR (established, production-ready, multilingual).
Two different approaches to Document AI. Check them out!
github.com/deepseek-ai/...
22.10.2025 11:06 β π 2 π 0 π¬ 0 π 0
ZenMux
The Enterprise LLM Platform. Get a Unified API for all models, intelligent routing, and AI Model Insurance to eliminate hallucination risk.
π Chinaβs InclusionAI (Ant Group/Alibaba) drops Ling-1Tβa trillion-parameter open-source LLM with only 50B active per token!
β
Beats Kimi-K2 & DeepSeek-V3
β
Top in math (AIMEβ25)
β
Efficient MoE design
β
Strong multimodal & tool-use (~70% BFCL V3)
github.com/inclusionAI/Ling-V2
21.10.2025 13:16 β π 0 π 0 π¬ 0 π 0
GitHub - SamsungSAILMontreal/TinyRecursiveModels
Contribute to SamsungSAILMontreal/TinyRecursiveModels development by creating an account on GitHub.
Samsung released TRM: Tiny Recursion Model (TRM), a ParameterβEfficient Approach to Recursive Reasoning
π Key Insight: Demonstrates that highβlevel reasoning on challenging tasks can be attained without largeβscale foundational models.
github.com/SamsungSAILM...
09.10.2025 16:58 β π 2 π 0 π¬ 0 π 0
Human3R: Everyone Everywhere All at Once
Human3R: Everyone Everywhere All at Once
Human3R is a unified, feed-forward framework for online 4D human-scene reconstruction, in the world frame, from casually captured monocular videos.
fanegg.github.io/Human3R/
08.10.2025 12:35 β π 1 π 0 π¬ 0 π 0
GitHub - deepseek-ai/DeepSeek-V3.2-Exp
Contribute to deepseek-ai/DeepSeek-V3.2-Exp development by creating an account on GitHub.
Deepseek AI released DeepSeek-V3.2-Exp, an experimental version of their LLM, built upon V3.1-Terminus by introducing DeepSeek Sparse Attention - designed to explore and validate optimizations for training and inference efficiency in long-context scenarios.
github.com/deepseek-ai/...
29.09.2025 13:28 β π 0 π 0 π¬ 0 π 0
GitHub - Tencent-Hunyuan/HunyuanImage-3.0: HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation - Tencent-Hunyuan/HunyuanImage-3.0
Tencent released HunyuanImage-3.0, a powerful native multimodal model for image generation.
The model has 80β―billion parameters and is currently the most powerful and largest openβsource imageβgeneration model available.
github.com/Tencent-Huny...
29.09.2025 10:46 β π 0 π 0 π¬ 1 π 0
SpikingBrain Technical Report: Spiking Brain-inspired Large Models
Mainstream Transformer-based large language models face major efficiency bottlenecks: training computation scales quadratically with sequence length, and inference memory grows linearly, limiting long...
The Chinese research group BICLab has announced what it describes as the worldβs first βbrainβlikeβ large language model - an AI system built to consume less power, deliver higher performance, and run without relying on Nvidia hardware.
arxiv.org/abs/2509.05276
25.09.2025 16:06 β π 0 π 0 π¬ 0 π 0
Wan-Animate
Wan-Animate: Unified Character Animation and Replacement with Holistic Replication
Wan-Animate is a unified framework for character animation and replacement.
humanaigc.github.io/wan-animate/
25.09.2025 15:33 β π 0 π 0 π¬ 0 π 0
GitHub - DecartAI/Lucy-Edit-ComfyUI
Contribute to DecartAI/Lucy-Edit-ComfyUI development by creating an account on GitHub.
Lucy Edit Dev is the first open-source instruction-guided video editing model that performs instruction-guided edits on videos using free-text prompts.
github.com/DecartAI/luc...
25.09.2025 15:23 β π 2 π 1 π¬ 0 π 0
GitHub - facebookresearch/map-anything: MapAnything: Universal Feed-Forward Metric 3D Reconstruction
MapAnything: Universal Feed-Forward Metric 3D Reconstruction - facebookresearch/map-anything
Meta released MapAnything: Universal Feed-Forward Metric
3D Reconstruction, a simple, end-to-end trained transformer model that directly regresses the factored metric 3D geometry of a scene given various types of inputs (images, calibration, poses, or depth).
github.com/facebookrese...
25.09.2025 15:18 β π 0 π 0 π¬ 0 π 0
GitHub - QwenLM/Qwen3-Omni: Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generat...
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time. ...
Alibaba released Qwen3-Omni, a natively end-to-end multilingual omni-modal (text, images, audio, video) foundation model, responding as real-time stream in both text and natural speech, available under open-source license.
github.com/QwenLM/Qwen3...
23.09.2025 11:25 β π 1 π 1 π¬ 0 π 0
Which GPU is best suited for AI models?
Find a brief breakdown of current GPU types, sorted by performance in our blog article: www.aime.info/blog/en/deep...
11.09.2025 14:00 β π 0 π 0 π¬ 0 π 0
Looking for an entry-level AI workstation to run LLMs locally?
π The AIME G500E is designed as maintainable efficient multi-GPU workstation with enough cooling and PSU capacity to host up to four high-end GPUs.
πΊ Have a look: www.aime.info/en/shop/prod...
11.09.2025 10:00 β π 0 π 0 π¬ 0 π 0
π¨π EPFL, ETH Zurich, & the Swiss National Supercomputing Centre (CSCS) released Apertus, Switzerlandβs first large-scale open, multilingual LLM.
Link to Paper: raw.githubusercontent.com/swiss-ai/ape...
Link to GitHub: github.com/swiss-ai/
Link to weights: huggingface.co/collections/...
11.09.2025 08:43 β π 0 π 0 π¬ 0 π 0
DeepSeek AI | Leading AI Language Models & Solutions
DeepSeek AI is the leading provider of advanced AI language models and enterprise solutions. Experience state-of-the-art artificial intelligence technology for your business needs.
DeepSeek V3.1 is out, advancing Artificial Intelligence.
It is a transformer-based architecture with 560 billion parameters and a 1 million token context window. Its multi-modal capabilities includes text, code, and image understanding and supports over 100 languages.
deepseek.ai/blog/deepsee...
03.09.2025 16:10 β π 0 π 0 π¬ 0 π 0
Chat with Z.ai - Free AI Chatbot powered by GLM-4.5
Start a free chat with your AI expert for code and smart tools. Tell Z.ai what you needβa complete full-stack application, a stunning presentation, or professional-grade writingβand get instant result...
The chinese company Z.ai released their model GLM-4.1V-Thinking and GLM-4.5V: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
The models are vLLM- and SGlang-ready!
github.com/zai-org/GLM-V/
03.09.2025 16:09 β π 0 π 0 π¬ 0 π 0
Chat with Z.ai - Free AI for Presentations, Writing & Coding
Start a free chat with your AI assistant. Tell Z.ai what you needβa stunning presentation, professional-grade writing, or a complex code scriptβand get instant results.
Chinese company Z.ai released their model GLM-4.5 open source, a series models are foundation models designed for intelligent agents.
π GLM-4.5: 355B total / 32B active parameters
π GLM-4.5-Air: 106B total / 12B active parameters
github.com/zai-org/GLM-...
29.07.2025 12:39 β π 0 π 0 π¬ 0 π 0
GitHub - MoonshotAI/Kimi-K2: Kimi K2 is the large language model series developed by Moonshot AI team
Kimi K2 is the large language model series developed by Moonshot AI team - MoonshotAI/Kimi-K2
The chinese Company MoonshotAI released Kimi K2 as a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters.
github.com/MoonshotAI/K...
15.07.2025 09:08 β π 0 π 0 π¬ 0 π 0
Black Forest Labs - Frontier AI Lab
Amazing AI models from the Black Forest.
Black Forest Lab released FLUX.1 Kontext [dev], which delivers proprietary-level image editing performance in a 12B parameter model that can run on consumer hardware.
bfl.ai/announcement...
27.06.2025 09:06 β π 0 π 0 π¬ 0 π 0
Reader API
Convert any URL to Markdown for better grounding LLMs.
ReaderLM-v2 is a 1.5 B-parameter language model specialized for HTML-to-Markdown conversion and HTML-to-JSON extraction.
jina.ai/reader
26.06.2025 09:58 β π 1 π 0 π¬ 0 π 0
GitHub - Tencent-Hunyuan/Hunyuan3D-2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material - Tencent-Hunyuan/Hunyuan3D-2.1
Tencent Hunyuan3D-2.1 is a scalable 3D asset creation system that advances state-of-the-art 3D generation through two pivotal innovations: Fully Open-Source Framework and Physically-Based Rendering (PBR) Texture Synthesis.
github.com/Tencent-Huny...
24.06.2025 10:11 β π 0 π 0 π¬ 0 π 0
π₯ Blog by Dariusz Majgier. AI, fun facts, science & brilliant ideas:
π https://patreon.com/go4know
π₯ Get prompts, art styles & tutorials. Learn how to create Midjourney images/videos for FREE!
π Join me: https://patreon.com/ai_art_tutorials
Exploring workforce skills for Industry 5.0. We are a #HorizonEU initiative funded by EU HaDEA
https://bridges5-0.eu/
Co-founder @ Thatβs Gonna Help | Growth & Automation Strategist | Web3, FinTech | AI Agents & Data Engineering
π§ Rare longreads:
https://substack.com/@dannyki
-Journaliste-
"LIBERTΓ_EGALITΓ_FRATERNITΓ"
ParΔ°stanbul
Grow your business with https://www.growth-hackers.net
Award-Winning Growth Hacking Agency | Lead Generation | Customer Acquisition | AI-Powered Digital Marketing | Sales | Branding...
AI Solutions Architect, Prompt Engineer
https://warth.ai
pragmatic Developer & Architect
saptodon.org/@hpseitz
Software Testing / Cyber Security / Home automation
Building a compliance tool with sveltekit π§‘ β coming Q1 2026
sec + compliance insights |
occasional movie/gadget ramblings
Sound, code, research, tea, swimming
www.adampultz.com
Your AI radar. Breaking developments + honest evaluations. Enterprise CDO who codes. Views = my own.
Director at Betaworks, running Camp program, making dad jokes.
Professor @AarhusUni doing research on organizational research methods and teaching deep neural networks in our Msc. BI program. https://sites.google.com/view/jesperwulff/bio
Researching the co-development of serious games related to community resilience.
Professor. Game Theory. Prediction Markets.
Dad β’ Gamer β’ Geek β’ noob
I'm building a video game and talking about the process.
I do not tolerate intolerance.
Current Focus: https://rebrand.ly/escape-now
engineer at etsy. writer. dreamer