AIME's Avatar

AIME

@aime-hq.bsky.social

AIME provides GPU cloud compute and develops AI-machines for deep learning and model inference (Multi-GPU workstations & HPC servers). We are in Berlin, Germany.

415 Followers  |  1,530 Following  |  209 Posts  |  Joined: 20.10.2023  |  1.5795

Latest posts by aime-hq.bsky.social on Bluesky

Preview
GitHub - deepseek-ai/DeepSeek-V3.2-Exp Contribute to deepseek-ai/DeepSeek-V3.2-Exp development by creating an account on GitHub.

DeepSeek released V3.2, a model that harmonizes high computational efficiency with superior reasoning & agent performance. It surpasses GPT-5 and exhibits reasoning proficiency on par with Gemini-3.0-Pro.

πŸ₯‡ Also it achieved Gold-medal performance in the 2025 IMO and IOI

github.com/deepseek-ai/...

02.12.2025 16:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - Tongyi-MAI/Z-Image Contribute to Tongyi-MAI/Z-Image development by creating an account on GitHub.

Alibaba releases a powerful image generator, Z-Image, with 6B parameters in three variants that shows highly competitive performance against other leading models, while achieving state-of-the-art results among open-source models.

github.com/Tongyi-MAI/Z...

28.11.2025 16:13 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - PaddlePaddle/ERNIE: The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle. The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle. - PaddlePaddle/ERNIE

πŸš€ Baidu just released **ERNIE-4.5-VL-28B-A3B-Thinking** β€” open-source (Apache 2.0)!

βœ… 3B active params
βœ… 100% multimodal reasoning
βœ… Visual reasoning, STEM, video understanding & β€œThinking with Images”
βœ… Tool use, precise grounding, dynamic zoom & search

πŸ‘‰ github.com/PaddlePaddle...

14.11.2025 11:48 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
MotionStream: Real-Time Video Generation with Interactive Motion Controls MotionStream is a real-time, motion-controlled video generation system that enables streaming generation of arbitrarily long videos for interactive applications.

MotionStream manipulates AI-generated videos in real time.

This is an exciting step towards a more intuitive, responsive, and creative future of AI content creation.

joonghyuk.com/motionstream...

11.11.2025 13:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - deepseek-ai/DeepSeek-OCR: Contexts Optical Compression Contexts Optical Compression. Contribute to deepseek-ai/DeepSeek-OCR development by creating an account on GitHub.

DeepSeek-OCR (new, LLM-centric, research-focused) vs. PaddleOCR (established, production-ready, multilingual).

Two different approaches to Document AI. Check them out!

github.com/deepseek-ai/...

22.10.2025 11:06 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
ZenMux The Enterprise LLM Platform. Get a Unified API for all models, intelligent routing, and AI Model Insurance to eliminate hallucination risk.

πŸš€ China’s InclusionAI (Ant Group/Alibaba) drops Ling-1Tβ€”a trillion-parameter open-source LLM with only 50B active per token!

βœ… Beats Kimi-K2 & DeepSeek-V3
βœ… Top in math (AIME’25)
βœ… Efficient MoE design
βœ… Strong multimodal & tool-use (~70% BFCL V3)

github.com/inclusionAI/Ling-V2

21.10.2025 13:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - SamsungSAILMontreal/TinyRecursiveModels Contribute to SamsungSAILMontreal/TinyRecursiveModels development by creating an account on GitHub.

Samsung released TRM: Tiny Recursion Model (TRM), a Parameter‑Efficient Approach to Recursive Reasoning

πŸ‘‰ Key Insight: Demonstrates that high‑level reasoning on challenging tasks can be attained without large‑scale foundational models.

github.com/SamsungSAILM...

09.10.2025 16:58 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Human3R: Everyone Everywhere All at Once Human3R: Everyone Everywhere All at Once

Human3R is a unified, feed-forward framework for online 4D human-scene reconstruction, in the world frame, from casually captured monocular videos.

fanegg.github.io/Human3R/

08.10.2025 12:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - deepseek-ai/DeepSeek-V3.2-Exp Contribute to deepseek-ai/DeepSeek-V3.2-Exp development by creating an account on GitHub.

Deepseek AI released DeepSeek-V3.2-Exp, an experimental version of their LLM, built upon V3.1-Terminus by introducing DeepSeek Sparse Attention - designed to explore and validate optimizations for training and inference efficiency in long-context scenarios.

github.com/deepseek-ai/...

29.09.2025 13:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - Tencent-Hunyuan/HunyuanImage-3.0: HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation - Tencent-Hunyuan/HunyuanImage-3.0

Tencent released HunyuanImage-3.0, a powerful native multimodal model for image generation.

The model has 80β€―billion parameters and is currently the most powerful and largest open‑source image‑generation model available.

github.com/Tencent-Huny...

29.09.2025 10:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
SpikingBrain Technical Report: Spiking Brain-inspired Large Models Mainstream Transformer-based large language models face major efficiency bottlenecks: training computation scales quadratically with sequence length, and inference memory grows linearly, limiting long...

The Chinese research group BICLab has announced what it describes as the world’s first β€œbrain‑like” large language model - an AI system built to consume less power, deliver higher performance, and run without relying on Nvidia hardware.

arxiv.org/abs/2509.05276

25.09.2025 16:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Wan-Animate Wan-Animate: Unified Character Animation and Replacement with Holistic Replication

Wan-Animate is a unified framework for character animation and replacement.

humanaigc.github.io/wan-animate/

25.09.2025 15:33 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - DecartAI/Lucy-Edit-ComfyUI Contribute to DecartAI/Lucy-Edit-ComfyUI development by creating an account on GitHub.

Lucy Edit Dev is the first open-source instruction-guided video editing model that performs instruction-guided edits on videos using free-text prompts.

github.com/DecartAI/luc...

25.09.2025 15:23 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - facebookresearch/map-anything: MapAnything: Universal Feed-Forward Metric 3D Reconstruction MapAnything: Universal Feed-Forward Metric 3D Reconstruction - facebookresearch/map-anything

Meta released MapAnything: Universal Feed-Forward Metric

3D Reconstruction, a simple, end-to-end trained transformer model that directly regresses the factored metric 3D geometry of a scene given various types of inputs (images, calibration, poses, or depth).

github.com/facebookrese...

25.09.2025 15:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - facebookresearch/cwm: Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation. Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation. - facebookresearch/cwm

Meta released "Code World Model" (CWM), a 32-billion-parameter open-weights LLM designed to advance research on code generation with world models.

The release includes model weights, technical report, model card, and starter code.

github.com/facebookrese...

25.09.2025 15:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - QwenLM/Qwen3-Omni: Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generat... Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time. ...

Alibaba released Qwen3-Omni, a natively end-to-end multilingual omni-modal (text, images, audio, video) foundation model, responding as real-time stream in both text and natural speech, available under open-source license.

github.com/QwenLM/Qwen3...

23.09.2025 11:25 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Which GPU is best suited for AI models?
Find a brief breakdown of current GPU types, sorted by performance in our blog article: www.aime.info/blog/en/deep...

11.09.2025 14:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Looking for an entry-level AI workstation to run LLMs locally?

πŸ‘‰ The AIME G500E is designed as maintainable efficient multi-GPU workstation with enough cooling and PSU capacity to host up to four high-end GPUs.

πŸ“Ί Have a look: www.aime.info/en/shop/prod...

11.09.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ‡¨πŸ‡­ EPFL, ETH Zurich, & the Swiss National Supercomputing Centre (CSCS) released Apertus, Switzerland’s first large-scale open, multilingual LLM.

Link to Paper: raw.githubusercontent.com/swiss-ai/ape...

Link to GitHub: github.com/swiss-ai/

Link to weights: huggingface.co/collections/...

11.09.2025 08:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
DeepSeek AI | Leading AI Language Models & Solutions DeepSeek AI is the leading provider of advanced AI language models and enterprise solutions. Experience state-of-the-art artificial intelligence technology for your business needs.

DeepSeek V3.1 is out, advancing Artificial Intelligence.

It is a transformer-based architecture with 560 billion parameters and a 1 million token context window. Its multi-modal capabilities includes text, code, and image understanding and supports over 100 languages.

deepseek.ai/blog/deepsee...

03.09.2025 16:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Chat with Z.ai - Free AI Chatbot powered by GLM-4.5 Start a free chat with your AI expert for code and smart tools. Tell Z.ai what you needβ€”a complete full-stack application, a stunning presentation, or professional-grade writingβ€”and get instant result...

The chinese company Z.ai released their model GLM-4.1V-Thinking and GLM-4.5V: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

The models are vLLM- and SGlang-ready!

github.com/zai-org/GLM-V/

03.09.2025 16:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - QwenLM/Qwen-Image: Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing. Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing. - QwenLM/Qwen-Image

Alibaba Cloud released Qwen Image, a 20B MMDiT image foundation model that achieves significant advances in complex text renderingand precise image editing.

The model is now natively supported in ComfyUI.

It’s said to outperform FLUX and comparable models.

github.com/QwenLM/Qwen-...

03.09.2025 16:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
AIME G500 - Multi GPU Workstation | AIME AIME G500 - Workstation Der AIME G500 ist als wartungsfreundlich High-End-GPU Workstation konzipiert, mit eine herausragende KΓΌhlleistung und Netzteil-KapazitΓ€t, um bis zu vier High-End-GPUs zu betrei...

The AIME G500 is now even more powerful, supporting the Threadripper Pro 99xxWX CPUs!

www.aime.info/de/shop/prod...

01.08.2025 16:25 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Chat with Z.ai - Free AI for Presentations, Writing & Coding Start a free chat with your AI assistant. Tell Z.ai what you needβ€”a stunning presentation, professional-grade writing, or a complex code scriptβ€”and get instant results.

Chinese company Z.ai released their model GLM-4.5 open source, a series models are foundation models designed for intelligent agents.

πŸ‘‰ GLM-4.5: 355B total / 32B active parameters

πŸ‘‰ GLM-4.5-Air: 106B total / 12B active parameters

github.com/zai-org/GLM-...

29.07.2025 12:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - MoonshotAI/Kimi-K2: Kimi K2 is the large language model series developed by Moonshot AI team Kimi K2 is the large language model series developed by Moonshot AI team - MoonshotAI/Kimi-K2

The chinese Company MoonshotAI released Kimi K2 as a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters.

github.com/MoonshotAI/K...

15.07.2025 09:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - wasserth/TotalSegmentator: Tool for robust segmentation of >100 important anatomical structures in CT and MR images Tool for robust segmentation of >100 important anatomical structures in CT and MR images - wasserth/TotalSegmentator

TotalSegmentator is a tool for segmentation of most major anatomical structures in any CT or MR image, created by the department of Research and Analysis at University Hospital Basel.

github.com/wasserth/Tot...

01.07.2025 09:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Black Forest Labs - Frontier AI Lab Amazing AI models from the Black Forest.

Black Forest Lab released FLUX.1 Kontext [dev], which delivers proprietary-level image editing performance in a 12B parameter model that can run on consumer hardware.

bfl.ai/announcement...

27.06.2025 09:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Reader API Convert any URL to Markdown for better grounding LLMs.

ReaderLM-v2 is a 1.5 B-parameter language model specialized for HTML-to-Markdown conversion and HTML-to-JSON extraction.

jina.ai/reader

26.06.2025 09:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Jina Embeddings v4: Universal Embeddings for Multimodal Multilingual Retrieval Jina Embeddings v4 is a 3.8 billion parameter universal embedding model for multimodal and multilingual retrieval that supports both single-vector and multi-vector embedding outputs.

Jina AI released Jina Embeddings v4, a 3.8 billion-parameter multimodal and multilingual embedding model based on the Qwen 2.5 VL 3B Instruct backbone. The architecture combines text and images in a unified semantic space.

jina.ai/news/jina-em...

26.06.2025 09:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - Tencent-Hunyuan/Hunyuan3D-2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material From Images to High-Fidelity 3D Assets with Production-Ready PBR Material - Tencent-Hunyuan/Hunyuan3D-2.1

Tencent Hunyuan3D-2.1 is a scalable 3D asset creation system that advances state-of-the-art 3D generation through two pivotal innovations: Fully Open-Source Framework and Physically-Based Rendering (PBR) Texture Synthesis.

github.com/Tencent-Huny...

24.06.2025 10:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@aime-hq is following 20 prominent accounts