There have been times that I was using an agent to add features to my own OS library, where it referred me to use... my own open source library.
01.03.2026 04:46 — 👍 2 🔁 0 💬 0 📌 0@gross.systems.bsky.social
Engineer at YipitData. NYC Area https://github.com/andrewgross/ https://gross.systems I was told I had to add AI Engineer to my profile for the bots to find me. Views my own, not my employer etc etc.
There have been times that I was using an agent to add features to my own OS library, where it referred me to use... my own open source library.
01.03.2026 04:46 — 👍 2 🔁 0 💬 0 📌 0I can see companies wanting to expose a way for your agent to hook into their systems/data, but with a better interface than just some query APIs. They would have their own prompting, skills, agents etc, but would be "prompted" by your local agent. Maybe the A2A protocol has something like this.
26.02.2026 02:22 — 👍 0 🔁 0 💬 0 📌 0Has anyone seen support for things like "remote skills" for Claude Code or similar? Its a bit different from MCP in that its not just calling a regular API endpoint. Im thinking something closer to how the Web Search tool works, where its really a mini remote agentic system returning results.
26.02.2026 02:20 — 👍 0 🔁 0 💬 1 📌 0Looks like at least 10 days ago www.youtube.com/watch?v=l3O4...
20.02.2026 02:14 — 👍 1 🔁 0 💬 0 📌 0Picture of the auto memory description text from https://code.claude.com/docs/en/memory
Picture of a claude code session where the agent decides to write to the MEMORY.md file for the project.
When did Claude Code auto memory start getting rolled out, cool as hell
20.02.2026 02:07 — 👍 0 🔁 0 💬 1 📌 0Have you been following model performance on SWE-Rebench to attempt to identify contamination? swe-rebench.com
19.02.2026 14:58 — 👍 3 🔁 0 💬 1 📌 0When agents have metrics, output tests, and input data, they can iterate like crazy. Being able to generate this loop this approach for any problem would be a huge timesaver and remove a lot of mental effort.
07.02.2026 18:39 — 👍 1 🔁 0 💬 0 📌 0For the next big jump for coding agents, I think the frontier labs are going to spend a lot of effort on making them good and creating their own problem specific harness for iterating.
07.02.2026 18:39 — 👍 0 🔁 0 💬 1 📌 0I want to make the concept of cells, output, and state in an ipykernel legible to them as well for the same reason. Much easier to collaborate when the end goal isnt just a top to bottom script every time.
07.02.2026 16:31 — 👍 1 🔁 0 💬 0 📌 0I have a sneaking suspicion time travel debuggers are going to come back in to vogue once they are made legible to agents.
07.02.2026 16:30 — 👍 1 🔁 0 💬 1 📌 0Maybe the harness needs to start with setting a target iteration speed and using it as an optimization metric.
07.02.2026 16:29 — 👍 0 🔁 0 💬 0 📌 0Getting agents to run experiments where you need to trade off feature optimization with running time optimization is tough. Agents don't experience linear time and don't have a "feeling" that they need to spend time optimizing to iterate faster.
07.02.2026 16:28 — 👍 0 🔁 0 💬 1 📌 0There's no reason the "Movie" Toothbrushing song on the Yoto has to go that hard.
01.02.2026 19:47 — 👍 2 🔁 0 💬 0 📌 0One of those cases where it would have been astounding to me if someone hadn't already investigated these problems deeply, I just didn't know how to find it.
29.01.2026 02:27 — 👍 0 🔁 0 💬 0 📌 0I was doing a bit of work with taking disjoin subgraphs and wanting to separate some of the larger ones based on overlapping connections and got introduced to the Louvain method and bridging.
29.01.2026 02:27 — 👍 0 🔁 0 💬 1 📌 0LLMs can be great for those cases (in coding at least) where you assume there is a body of work around a problem, but you don't know the terminology to find it. In the past you just had to Google and hope, or ask a coworker. Still possible to fail with LLMs but can be easier.
29.01.2026 02:24 — 👍 0 🔁 0 💬 1 📌 0Github Stars are a very salient example of Goodhart's law right now.
27.01.2026 16:01 — 👍 0 🔁 0 💬 0 📌 0I feel like a time travel debugger that lets you rewind state while making edits etc could be super helpful for them, given the right interface. Makes it easier to take actions that are easily reversible.
22.01.2026 21:35 — 👍 0 🔁 0 💬 0 📌 0My kingdom for sparse mxfp4 kernels in FlashInfer for sm120
22.01.2026 15:00 — 👍 1 🔁 0 💬 0 📌 0CC behaves differently with Shift-Enter in the terminal vs as a Cursor extension. Its maddening.
13.01.2026 18:48 — 👍 1 🔁 0 💬 1 📌 0If if you let it iterate on local hardware before crafting Kaggle notebooks for submission, I bet a lot of the trick would be to get it to be able to run and monitor long async jobs, and make decisions about when to optimize training speed.
13.01.2026 03:57 — 👍 0 🔁 0 💬 0 📌 0I want to figure out a harness so that I can run Claude Code against a Kaggle competition. I feel like it could pretty reliably score silver in many competitions, assuming it was given access to the forums and had a decent harness
13.01.2026 03:57 — 👍 0 🔁 0 💬 1 📌 0ASCII/text flow chart with an arrow between nodes where the arrow character's shaft isnt quite aligned with the line character's height
New requirement for monospace fonts: When coding agents draw diagrams the arrow lines are aligned.
09.01.2026 03:13 — 👍 0 🔁 0 💬 0 📌 0I used to think that typing speed beyond a certainly level had diminishing returns for programming. That said, it seems like there is a lot of value in being able to quickly convey prompts via typing, as voice transcription lacks some capabilities for things like linking files or code.
08.01.2026 18:00 — 👍 2 🔁 0 💬 0 📌 0I remember a while back there was a lot of discourse around codebases that are "greppable" vs those that are expected to be navigated with an IDE. Seems like greppable ended up being the correct choice for coding agents (at leat for now).
07.01.2026 21:50 — 👍 2 🔁 0 💬 0 📌 0Trying out a new thing: "This code dictated but not read" to absolve myself of all responsibility for LLM generated code, just like executives in the 60s would place all the blame for their bad ideas on their secretary.
12.12.2025 03:57 — 👍 2 🔁 0 💬 0 📌 0
Release PySpark Toolkit 0.5.0:
* support for S3 Presigned URLs in Pure Pyspark
* `fdtf` - A generic UDTF decorator that can wrap an arbitrary row level python function and append its results as columns to your dataframe. Cuz UDTFs are annoying to use.
One thing I noticed with a lot of AI blog spam is that they never seem to properly link things. I just went over an entire article about a tool I wanted to check out, and not once did it actually link to the tool.
28.11.2025 12:22 — 👍 0 🔁 0 💬 0 📌 0