we heard you hate writing boilerplate code
so we built something...
> open gradio sketch
> select and add components
> configure visually
> get perfect python code ๐คฏ
Building AI apps will never be the same ๐ฅ
Coming very soon ๐
@mathiasesn1.bsky.social
๐ข Senior Machine Learning Engineer @ https://mediacatch.io/ ๐ป https://grandaiwizard.com/ ๐จโ๐ป https://github.com/mathiasesn ๐ https://www.linkedin.com/in/mathias-emil-slettemark-nielsen-bab596150/ ๐ค https://huggingface.co/mathiasn1
we heard you hate writing boilerplate code
so we built something...
> open gradio sketch
> select and add components
> configure visually
> get perfect python code ๐คฏ
Building AI apps will never be the same ๐ฅ
Coming very soon ๐
[dk] Ok, nu begynder det at blive dumt, det her..
02.02.2025 16:58 โ ๐ 26 ๐ 2 ๐ฌ 2 ๐ 0Weโre building a new static type checker for Python, from scratch, in Rust.
From a technical perspective, itโs probably our most ambitious project yet. Weโre about 800 PRs deep!
๐คฏ Vector search on top of millions of docs in seconds. no pre-indexing!
Model2Vec is an embedding powerhouse that distils good models and makes them up by 500x faster and 15x smaller.
Vector Search on Hub Datasets demo: https://buff.ly/4gYhVlY
Library: https://buff.ly/42miwte
The image shows an illustration titled "Hygge Web Data" featuring three cartoon animals - a fox, an owl, and what appears to be a bear or similar animal - sitting at a table or surface reviewing various documents and papers. The style is cute and whimsical, with the animals drawn in a simple, friendly manner. Each animal is looking at different papers with sketched symbols, text, and designs on them. The illustration has a gentle, cozy feel to it, fitting with the "hygge" (Danish concept of coziness and comfort) mentioned in the title.
Introducing Scandi-fine-web-cleaner, a decoder model trained to remove low-quality web from FineWeb 2 for Danish and Swedish
- Uses FineWeb-c community annotations
- 90%+ precision + minimal compute required
- Enables efficient filtering of 43M+ documents
huggingface.co/davanstrien/...
This is a particularly bad case-study in how badly AI summarization can go when its exposed to the wilds of the internet - posted some notes on my blog: simonwillison.net/2024/Dec/29/...
29.12.2024 01:32 โ ๐ 117 ๐ 23 ๐ฌ 11 ๐ 7Er selv stor fan af squidfunk.github.io/mkdocs-mater... ๐จโ๐ป
23.12.2024 19:50 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0๐ฅ Ending 2024: A full data annotation journey on the Hugging Face Hubโfrom raw data to training-ready datasets!
With Argilla 2.6.0, push your data to the Hub from the UI
Letโs make 2025 the year anyone can build more transparent and accountable AIโno coding or model skills needed.
The Phi-4 Technical Report briefly mentions the importance of the sequence in which the training data is fed to the model. Actually I think that determining the ideal sequence should be the next big research topic.
huggingface.co/papers/2412....
Angreb pรฅ Ultralytics via GitHub Actions og PyPI: blog.pypi.org/posts/2024-1... #dkdev
14.12.2024 16:25 โ ๐ 1 ๐ 1 ๐ฌ 0 ๐ 0Stรฆrkt! ๐ช
09.12.2024 18:41 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Overview of PixMo and its relation to Molmo's ability. PixMo's captions data enables Molmo's fine-grained understanding; PixMo's AskModelAnything enables Molmo's user interaction; PixMo's pointing data enables Molmo's pointing and counting; PixMo's synthetic data enables Molmo's visual skills.
Remember Molmo? The full recipe is finally out!
Training code, data, and everything you need to reproduce our models. Oh, and we have updated our tech report too!
Links in thread ๐
๐ฎ New YouTube video! ๐ฎ
We experimented with a log monitoring system. Spin up the agent and it'll monitor your logs for any potential issues -- it works with webserver logs like nginx or Apache, Linux system logs, etc.
youtu.be/csw6TVfzBcw
Look at this! ๐คฉ
@pydantic.bsky.social for AI Agents ๐ค๐
@moltke.bsky.social, tror du det er et uheld?
28.11.2024 18:30 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0FYI, here's the entire code to create a dataset of every single bsky message in real time:
```
from atproto import *
def f(m): print(m.header, parse_subscribe_repos_message())
FirehoseSubscribeReposClient().start(f)
```
The thing is, there's already a dataset of 235 MILLION posts from 4 MILLION users available for months. Not sure why @hf.co is a target of abuse
zenodo.org/records/1108...
Hvad er pรฅ vej? En ny evaluering til scandeval? ๐
26.11.2024 13:37 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Gad vide hvad AI-Sweden-Models/Llama-3-8B-instruct (few-shot) (med en rank pรฅ 1.35 og en top 4 placering) har gjort rigtigt? ๐ค
26.11.2024 13:13 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0*** New Model on ScandEval ***
New Danish LLM from the NLP North Lab, SnakModel, based on Llama-2-7b.
Danish results (lower is better):
- NLPnorth/snakmodel-7b-base: 3.60
- NLPnorth/snakmodel-7b-instruct: 2.59
For reference, Llama-2-7b achieves 3.08.
Leaderboards: scandeval.com
#dkai #nlp
Vi er gรฅet all in pรฅ uv i MediaCatch og det har givet nogle gevaldige speed ups i GitHub workflows og docker builds. ๐ฅ
Man skal dog vรฆre opmรฆrksom pรฅ cache, da den kan bruge en del plads. Sรฅ uv cache prune en gang imellem. ๐
๐ฑ Jeg har lavet et feed, der samler dansk tech-indhold via hashtaggene #dkai, #dkdev og #dktech! Fรธlg med for at holde dig opdateret med det danske tech-community ๐ฉ๐ฐ
Prรธv det her: bsky.app/profile/did:...