ethicalabs.ai's Avatar

ethicalabs.ai

@ethicalabs.bsky.social

Building practical, ethical and sustainable AI/ML solutions https://www.ethicalabs.ai/

17 Followers  |  42 Following  |  10 Posts  |  Joined: 14.02.2025  |  1.7042

Latest posts by ethicalabs.bsky.social on Bluesky

Post image

This paper is making the rounds: arxiv.org/abs/2506.21734

A tiny (27M) brain-inspired model trained just on 1000 samples outperforming o3-mini-high on reasoning tasks.

#MLSky πŸ§ πŸ€–

03.08.2025 02:01 β€” πŸ‘ 128    πŸ” 27    πŸ’¬ 4    πŸ“Œ 1
Preview
GitHub - ethicalabs-ai/completionist: Command-line tools for Synthetic Datasets Generation Command-line tools for Synthetic Datasets Generation - ethicalabs-ai/completionist

Introducing Completionist, an open-source command-line tool that automates synthetic text dataset generation.

πŸ‘‰ Check out Completionist on #GitHub: github.com/ethicalabs-a...

#LLMs #GenerativeAI #DataEngineering #FineTuning #OpenSource #Python #SyntheticData #RAG

02.08.2025 12:58 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Building and Sharing a Multimodal ViT Model for Skin Lesion Analysis From Proof of Concept to Huggingface App πŸ€—

Building and Sharing a Multimodal ViT Model for Skin Lesion Analysis: From Proof of Concept to Hugging Face App πŸ€— #huggingface #MLSky #opensource hashtag#python #vit #transformers #medicalai #visionmodel #skincancer

medium.com/@massimo.sca...

02.08.2025 12:36 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Kurtis-E1.1: Supervised Fine-tuning of Qwen2.5-3B-Instruct with Flower.ai & Hugging Face A Blog post by Massimo Roberto Scamarcia on Hugging Face

Rather than chasing benchmark supremacy or scaling wars, Kurtis E1.1 focuses on understanding, sustainability, and practical impact, especially in areas like mental health support and safer human-AI interaction

huggingface.co/blog/mrs83/k...

#MLSky #EthicalAI #LLM

02.04.2025 17:49 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

To developers: Build opt-in systems.
To policymakers: Legislate data transparency.
To artists: Unionize.
To users: Demand ethical tools.

#EthicalAI #MLSky

30.03.2025 18:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Kurtis-E1-MLX-Voice-Agent
YouTube video by Massimo Scamarcia Kurtis-E1-MLX-Voice-Agent

Just built an offline voice assistant for macOS:
🎀 Whisper STT (MLX)
🧠 LLM via #Ollama
πŸ—£οΈ XTTSv2 TTS
🌍 Optional translation
No cloud. No tracking. No vibe coding β€” all handcrafted.
Demo here πŸŽ₯ www.youtube.com/watch?v=8-1P...

#OnDeviceAI #LLM #Privacy #TTS #STT

24.03.2025 23:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Testing Kurtis E1 Can a Small, Fine-Tuned Model Generalize Beyond Its Training Scope?

Testing Kurtis E1 beyond its training scopeβ€”AI ethics, decentralization, even philosophy. No hallucinations, just structured reasoning. Is this emergent? You decide.

Read more: medium.com/@massimo.sca... #LLM #AI #EthicalAI

05.03.2025 23:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
AI can now model and design the genetic code for all domains of life with Evo 2 | Arc Institute Arc Institute develops the largest AI model for biology to date in collaboration with NVIDIA, bringing together Stanford University, UC Berkeley, and UC San Francisco researchers

πŸ§ͺ The Arc Institute's Evo2 models DNA like an LLM models language, predicting mutations, gene function, and evolutionary signals. With 40B parameters trained on 128K genomes, it hints at AI-driven biological discovery. πŸ§¬πŸ’» #MLSky
Link to the paper: https://arcinstitute.org/manuscripts/Evo2

24.02.2025 15:15 β€” πŸ‘ 14    πŸ” 7    πŸ’¬ 0    πŸ“Œ 1
Preview
πŸŒ€ Ouroboros: Recursive LLM Refinement with Small Models From Synthetic Data to Recursive Self-Refinement: An On-Device Experiment

πŸŒ€ Ouroboros: Small models drive recursive #LLM self-refinement for synthetic datasets generation. On-device AI shaping smarter futures! #EdgeAI #OpenSourceWeek #DeepSeekR1 #Ollama medium.com/@massimo.sca...

22.02.2025 21:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸš€ NVIDIA Minitron: Efficient LLM Compression!

arxiv.org/pdf/2408.11796

Minitron uses pruning + distillation to create smaller, high-performance models

πŸ”‘ Highlights:
- Teacher Correction: Adapts models to new data
- Structured #Pruning: 2.7x faster inference
- #Distillation: Uses 40x fewer tokens

19.02.2025 21:07 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
OpenAI scrubs diversity commitment web page from its site OpenAI has eliminated a page on its website that used to express its commitment to diversity, equity, and inclusion. The URL β€œhttps://openai.com/commitment-to-dei/” now redirects to β€œhttps://openai.com/building-dynamic-teams/,” a page that talks about people with β€œdifferent backgrounds” with no use of the word β€œdiversity.” The previous page stated that the company’s β€œinvestment in diversity, equity and inclusion” […]

OpenAI scrubs diversity commitment web page from its site

OpenAI has eliminated a page on its website that used to express its commitment to diversity, equity, and inclusion. The URL β€œhttps://openai.com/commitment-to-dei/” now redirects to β€œhttps://openai.com/building-dynamic-te…

#ai #news #openai

14.02.2025 19:42 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
This Nashville Startup Is Protecting Kids Online with Smarter, Safer AI - Hypepotamus Nashville-basedΒ Angel QΒ (previously known asΒ Angel Kids AI), got its start building a safer browser option for kids to access the internet. But browsers are not the only way of searching for […]

Arcee AI and AngelQ just launched KidRails for hashtag #LLMs β€” an open-source framework for safe, age-appropriate #AI responses for children

hypepotamus.com/startup-news...

Setting new standards in security, transparency, and responsibility 🌸 #EthicalAI #ML

14.02.2025 19:44 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 4
Preview
They Said It Couldn’t Be Done A Blog post by Pierre-Carl Langlais on Hugging Face

Pleias is a large language model trained exclusively on open data. It was developed using the Common Corpus, a dataset that addresses the need for high-quality compliant training data in AI development. huggingface.co/blog/Pclangl...

#opensourcellm #opendata #commoncorpus #llm #ai #ml

14.02.2025 19:39 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Active Inheritance, A Smarter Way to Train Models with Synthetic Data The practice of fine-tuning models on synthetic data is becoming well established. But synthetic training data, even if it represents the training...

A naive way to generate synthetic fine-tuning data is to feed prompts to a model, collect its output, and use that as the fine-tuning set. Synthetic data is cheap, so we can afford to be more choosy. By generating responses to each prompt, we can select the one that best suits our purposes. #AI #ML

06.02.2025 12:53 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

@ethicalabs is following 19 prominent accounts