Is there a write up for this ?
07.07.2025 17:38 β π 0 π 0 π¬ 0 π 0@lopezgg.bsky.social
Catholic, Indian & Scientist MSFT: I help phi understand longer context and finetune LLM for domain specific knowledge (background yellow-rumped warbler from @carlbergstrom.com)
Is there a write up for this ?
07.07.2025 17:38 β π 0 π 0 π¬ 0 π 0This article impressed me and gave me great hope for the papacy of Leo XIV.
22.05.2025 19:26 β π 6 π 2 π¬ 0 π 0Also, set up is designed such that the priest wont see our faces and no one declares their name in a confession so implementing the law is another difficulty.
09.05.2025 17:15 β π 0 π 0 π¬ 0 π 0Not a citizen so cant comment much. May be a good test but is it ethical to experiment on a small group who hold traditions dear. I just fear it might alienate people and bring more harm than good.
09.05.2025 17:00 β π 0 π 0 π¬ 1 π 0Well confession is short which goes like "Father I have sinned ..blah.. I did blah"
There is no relationship built & you dont see much of the priest face. I doubt kids would feel comfortable to confide. There's less ROI in this law. I am a moderate catholic so take it grain of salt.
If you put this in law and confidentiality is broken, would the abusers confess ?
09.05.2025 07:53 β π 0 π 0 π¬ 1 π 0Crack for Latin nerds like me.
08.05.2025 22:50 β π 74 π 3 π¬ 5 π 1Policy Gradients chapter of RLHF Book is MUCH improved after all the wonderful GRPO discussions in the last few weeks π₯°
(still open to bug reports)
If you've ever wanted to learn how the transformer architecture in general or latent multi-head attention works, here's an excellent visual explainer: www.youtube.com/watch?v=0VLA...
09.03.2025 17:46 β π 38 π 8 π¬ 2 π 0A Catholic nun was the first U.S. woman to earn a Ph.D. in computer science. β History Facts historyfacts.com/science-indu...
01.03.2025 17:34 β π 11 π 3 π¬ 0 π 0What is GGUF, Safetensors, PyTorch, ONNX?
In this blog post, let's discover common formats for storing an AI model.
huggingface.co/blog/ngxson/...
If you ARE an AI, here's a free PDFβhave at it www.probabilistic-numerics.org/textbooks/
23.02.2025 16:39 β π 10 π 1 π¬ 2 π 0After 6+ months in the making and over a year of GPU compute, we're excited to release the "Ultra-Scale Playbook": hf.co/spaces/nanot...
A book to learn all about 5D parallelism, ZeRO, CUDA kernels, how/why overlap compute & coms with theory, motivation, interactive plots and 4000+ experiments!
This excellent interactive tutorial on misleading data visualizations explores the idea of a "counter chart" β the graph you draw in response to refute a misleading claims
flowingdata.com/projects/dis...
Best part of this that Luca isnβt highlighting to start is that we trained a way better OLMoE for this too.
All from better annealing and post train. Didnβt need to redo pre training. Goes to show how much potential these models have!
new instruct model: huggingface.co/allenai/OLMo...
"The Tears of Things" by Richard Rohr invites us to explore the wisdom of the Hebrew prophets.
He writes βPower distorts truth, so God plants and develops it at the edge, where the power-hungry least expect it,β inviting us to the βedge of the inside.β tinyurl.com/46z9574r
Tutorial on scaling LM with Jax
jax-ml.github.io/scaling-book/
This paper is wild - a Stanford team shows the simplest way to make an open LLM into a reasoning model
They used just 1,000 carefully curated reasoning examples & a trick where if the model tries to stop thinking, they append "Wait" to force it to continue. Near o1 at math. arxiv.org/pdf/2501.19393
o3-mini is really good at writing internal documentation - feed it a codebase, get back a detailed explanation of how specific aspects of it work https://simonwillison.net/2025/Feb/5/o3-mini-documentation/
05.02.2025 06:09 β π 22 π 8 π¬ 1 π 2The main foundation-model-training companies spend a lot on curating their data these days. Whereas it used to be some simple quality filters, it's now a complex multi-stage pipeline. But yeah, no one usually shares statistical bias and variance analyses with their benchmarks.
29.01.2025 14:52 β π 1 π 1 π¬ 1 π 0π€£
28.01.2025 14:41 β π 0 π 0 π¬ 0 π 0Aaaaah good timing, published today!
"we introduce Mini Worldlit, a manually curated dataset of 1,192 works of contemporary fiction from 13 countries, representing nine languages"
By @andrewpiper.bsky.social, @dbamman.bsky.social, Christina Han, Jens Bjerring-Hansen, @hoytlong.bsky.social, et al.
If you want to quickly catch up on all the open modeling things (DeepSeek, ModernBERT, etc.), this was a great overview, by @natolambert.bsky.social.
I somehow got into an argument last week with someone who was insisting that all models are industrial blackboxes... and I wish I'd had this on hand.
Including a model with context length of 4M tokens!
17.01.2025 16:31 β π 1 π 1 π¬ 0 π 0The blog post of the late Felix Hill is powerful. Stress for AI researchers today is real.
I did not know Felix Hill and I am sorry for those who did.
This story is perhaps a reminder for students, postdocs, founders and researchers to take care of their well being.
medium.com/@felixhill/2...
Free course on Agents by Hugging Face. We just added a chapter to smol course on agents. Naturally, using smolagents! The course cover these topics:
- Code agents
- Retrieval agents
- Custom functional
If you're building agent applications, this course should help.
Visited the exposition of St. Francis Xavier
www.soultravelling.in/blog/know-al...
So, here you go
βFor those who believe, no explanation is necessary. For those who do not believe, no explanation is possible.β
Oh and an another quote I saw there very dear to me
"The outward adornment of the body should be a reflection of the inner virtue of the soul."
St. Thomas Aquinas' teachings in the Summa Theologica