Mathias Nielsen's Avatar

Mathias Nielsen

@mathiasesn1.bsky.social

๐Ÿข Senior Machine Learning Engineer @ https://mediacatch.io/ ๐Ÿ’ป https://grandaiwizard.com/ ๐Ÿ‘จโ€๐Ÿ’ป https://github.com/mathiasesn ๐Ÿ”— https://www.linkedin.com/in/mathias-emil-slettemark-nielsen-bab596150/ ๐Ÿค— https://huggingface.co/mathiasn1

93 Followers  |  105 Following  |  7 Posts  |  Joined: 21.11.2024  |  1.7611

Latest posts by mathiasesn1.bsky.social on Bluesky

Video thumbnail

we heard you hate writing boilerplate code

so we built something...

> open gradio sketch
> select and add components
> configure visually
> get perfect python code ๐Ÿคฏ

Building AI apps will never be the same ๐Ÿ”ฅ
Coming very soon ๐Ÿ‘€

19.02.2025 10:12 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

[dk] Ok, nu begynder det at blive dumt, det her..

02.02.2025 16:58 โ€” ๐Ÿ‘ 26    ๐Ÿ” 2    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Post image

Weโ€™re building a new static type checker for Python, from scratch, in Rust.

From a technical perspective, itโ€™s probably our most ambitious project yet. Weโ€™re about 800 PRs deep!

29.01.2025 17:18 โ€” ๐Ÿ‘ 734    ๐Ÿ” 104    ๐Ÿ’ฌ 35    ๐Ÿ“Œ 36
Preview
Vectorsearch Hub Datasets - a Hugging Face Space by davidberenstein1957 Add vectors to Hub datasets and do in memory vector search.

๐Ÿคฏ Vector search on top of millions of docs in seconds. no pre-indexing!

Model2Vec is an embedding powerhouse that distils good models and makes them up by 500x faster and 15x smaller.

Vector Search on Hub Datasets demo: https://buff.ly/4gYhVlY
Library: https://buff.ly/42miwte

24.01.2025 13:00 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
The image shows an illustration titled "Hygge Web Data" featuring three cartoon animals - a fox, an owl, and what appears to be a bear or similar animal - sitting at a table or surface reviewing various documents and papers. The style is cute and whimsical, with the animals drawn in a simple, friendly manner. Each animal is looking at different papers with sketched symbols, text, and designs on them. The illustration has a gentle, cozy feel to it, fitting with the "hygge" (Danish concept of coziness and comfort) mentioned in the title.

The image shows an illustration titled "Hygge Web Data" featuring three cartoon animals - a fox, an owl, and what appears to be a bear or similar animal - sitting at a table or surface reviewing various documents and papers. The style is cute and whimsical, with the animals drawn in a simple, friendly manner. Each animal is looking at different papers with sketched symbols, text, and designs on them. The illustration has a gentle, cozy feel to it, fitting with the "hygge" (Danish concept of coziness and comfort) mentioned in the title.

Introducing Scandi-fine-web-cleaner, a decoder model trained to remove low-quality web from FineWeb 2 for Danish and Swedish

- Uses FineWeb-c community annotations
- 90%+ precision + minimal compute required
- Enables efficient filtering of 43M+ documents

huggingface.co/davanstrien/...

13.01.2025 15:48 โ€” ๐Ÿ‘ 17    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

This is a particularly bad case-study in how badly AI summarization can go when its exposed to the wilds of the internet - posted some notes on my blog: simonwillison.net/2024/Dec/29/...

29.12.2024 01:32 โ€” ๐Ÿ‘ 117    ๐Ÿ” 23    ๐Ÿ’ฌ 11    ๐Ÿ“Œ 7
Preview
Material for MkDocs Write your documentation in Markdown and create a professional static site in minutes โ€“ searchable, customizable, in 60+ languages, for all devices

Er selv stor fan af squidfunk.github.io/mkdocs-mater... ๐Ÿ‘จโ€๐Ÿ’ป

23.12.2024 19:50 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

๐Ÿ’ฅ Ending 2024: A full data annotation journey on the Hugging Face Hubโ€”from raw data to training-ready datasets!

With Argilla 2.6.0, push your data to the Hub from the UI

Letโ€™s make 2025 the year anyone can build more transparent and accountable AIโ€”no coding or model skills needed.

20.12.2024 11:14 โ€” ๐Ÿ‘ 20    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Paper page - Phi-4 Technical Report Join the discussion on this paper page

The Phi-4 Technical Report briefly mentions the importance of the sequence in which the training data is fed to the model. Actually I think that determining the ideal sequence should be the next big research topic.

huggingface.co/papers/2412....

14.12.2024 18:32 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Supply-chain attack analysis: Ultralytics - The Python Package Index Blog Analysis of a package targeted by a supply-chain attack to the build and release process

Angreb pรฅ Ultralytics via GitHub Actions og PyPI: blog.pypi.org/posts/2024-1... #dkdev

14.12.2024 16:25 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image 09.12.2024 04:28 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Stรฆrkt! ๐Ÿ’ช

09.12.2024 18:41 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Overview of PixMo and its relation to Molmo's ability. PixMo's captions data enables Molmo's fine-grained understanding; PixMo's AskModelAnything enables Molmo's user interaction; PixMo's pointing data enables Molmo's pointing and counting; PixMo's synthetic data enables Molmo's visual skills.

Overview of PixMo and its relation to Molmo's ability. PixMo's captions data enables Molmo's fine-grained understanding; PixMo's AskModelAnything enables Molmo's user interaction; PixMo's pointing data enables Molmo's pointing and counting; PixMo's synthetic data enables Molmo's visual skills.

Remember Molmo? The full recipe is finally out!

Training code, data, and everything you need to reproduce our models. Oh, and we have updated our tech report too!

Links in thread ๐Ÿ‘‡

09.12.2024 18:33 โ€” ๐Ÿ‘ 78    ๐Ÿ” 14    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Creating a Structured AI Log Analysis System with Python & LLMs
YouTube video by dottxt Creating a Structured AI Log Analysis System with Python & LLMs

๐Ÿฎ New YouTube video! ๐Ÿฎ

We experimented with a log monitoring system. Spin up the agent and it'll monitor your logs for any potential issues -- it works with webserver logs like nginx or Apache, Linux system logs, etc.

youtu.be/csw6TVfzBcw

05.12.2024 17:08 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

Look at this! ๐Ÿคฉ

@pydantic.bsky.social for AI Agents ๐Ÿค–๐Ÿš€

02.12.2024 11:33 โ€” ๐Ÿ‘ 17    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@moltke.bsky.social, tror du det er et uheld?

28.11.2024 18:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

FYI, here's the entire code to create a dataset of every single bsky message in real time:

```
from atproto import *
def f(m): print(m.header, parse_subscribe_repos_message())
FirehoseSubscribeReposClient().start(f)
```

28.11.2024 09:56 โ€” ๐Ÿ‘ 443    ๐Ÿ” 62    ๐Ÿ’ฌ 20    ๐Ÿ“Œ 10

The thing is, there's already a dataset of 235 MILLION posts from 4 MILLION users available for months. Not sure why @hf.co is a target of abuse

zenodo.org/records/1108...

28.11.2024 01:32 โ€” ๐Ÿ‘ 116    ๐Ÿ” 13    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 0

Hvad er pรฅ vej? En ny evaluering til scandeval? ๐Ÿ˜€

26.11.2024 13:37 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Gad vide hvad AI-Sweden-Models/Llama-3-8B-instruct (few-shot) (med en rank pรฅ 1.35 og en top 4 placering) har gjort rigtigt? ๐Ÿค”

26.11.2024 13:13 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
NLPnorth/snakmodel-7b-instruct ยท Hugging Face Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.

*** New Model on ScandEval ***

New Danish LLM from the NLP North Lab, SnakModel, based on Llama-2-7b.

Danish results (lower is better):
- NLPnorth/snakmodel-7b-base: 3.60
- NLPnorth/snakmodel-7b-instruct: 2.59

For reference, Llama-2-7b achieves 3.08.

Leaderboards: scandeval.com

#dkai #nlp

26.11.2024 12:21 โ€” ๐Ÿ‘ 9    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

Vi er gรฅet all in pรฅ uv i MediaCatch og det har givet nogle gevaldige speed ups i GitHub workflows og docker builds. ๐Ÿ”ฅ
Man skal dog vรฆre opmรฆrksom pรฅ cache, da den kan bruge en del plads. Sรฅ uv cache prune en gang imellem. ๐Ÿ˜…

24.11.2024 19:09 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ“ฑ Jeg har lavet et feed, der samler dansk tech-indhold via hashtaggene #dkai, #dkdev og #dktech! Fรธlg med for at holde dig opdateret med det danske tech-community ๐Ÿ‡ฉ๐Ÿ‡ฐ

Prรธv det her: bsky.app/profile/did:...

23.11.2024 14:53 โ€” ๐Ÿ‘ 10    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@mathiasesn1 is following 20 prominent accounts