Have we created a word yet for groups of agents that are constrained within nodes of a rigid workflow?
If you zoomed into a workflow node, you'd say: "oh yeah, this is agentic computing"
But if you zoom out, you'd say: "this is a classic workflow engine"
14.01.2025 15:24 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
"Jagged Frontier" is the term that I've been looking for the past two years of AI growth.
It's difficult from any one point on the frontier to make strong inferences about adjacent points.
E.g. AI can draft top-quality research memos, but still struggles at cocktail party chat.
10.01.2025 18:43 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I (non-ironically) love that one of the top posts on HN right now is about making "beautiful" API keys..
..in which the first key in this pic is an example of "ugly", while the second key in this pic is an example of "beautiful"
10.01.2025 14:31 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
๐๐ "The last book I wrote, Iโm happy if humans read it, but I mostly wrote it for the AIs. And my next book Iโm writing even more for the AIs." -Tyler Cowen
09.01.2025 20:04 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The first big red "Destroy the LLM!" button will be because of national security fears, not evil AGI fears.
If a government trained an LLM on its intelligence materials, the model weights would be the most sensitive asset in its possession.
The opposite of compartmentalized information.
09.01.2025 16:19 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
What will the first AI Morris Worm be?
It's bound to happen..
20.12.2024 20:16 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Informed shoulder shrugs in the small room often precede confident parrots in the big room.
14.12.2024 19:58 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Totally. Also unless youโre using a very differently aligned LLM, the evaluator often suffers from the same judgement errors.
Eg preferring the same troupes or succumbing to the same reasoning errors
07.12.2024 23:17 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
I guess the ใคใชใฟ was a ใฃใชใฟ.
05.12.2024 19:52 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
We've got to get the LLMs back in the office -- they're barely working hard remotely!
05.12.2024 19:45 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Strategies for agents automation are different when you're thinking in data SETS rather than POINTS.
Eg with an imperfect agent, you can:
- Maximize shots on goal, then detect which went in
- Maximize shot opportunities, then just take the best K
05.12.2024 18:33 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Google has a ways to go...
ChatGPT is better at GSheets formula help than the in-app Gemini.
05.12.2024 17:28 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
That ten minute dance scene in Wicked captured the entire range of high school emotions better than anything thatโs ever been filmed.
04.12.2024 01:54 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
26.11.2024 21:21 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
26.11.2024 21:11 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
26.11.2024 21:06 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
Hats off to the team that built Shopify's "Collaborators" feature.
Fine-grained permissions with OAuth is a tornado of pain even for developers.
Shopify nailed it -- and for non-developers no less!
26.11.2024 20:01 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
True alignment would be LLMs making us listen to a 15 minute story about weekends at grandmaโs house before giving us the recipe for blueberry cobbler.
- The Recipe Blog Lobby
24.11.2024 19:25 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I wonder how low-level you could take that.
Trying to blend OCR and LLM output is what prompted the tweet. Using each to smooth the flaws in the other.
Eg OCR breaks text stupidly and makes blurry-vision errors.
LLM fixes that.. but then goes overboard and hallucinates semantically.
24.11.2024 16:48 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Teams working on OCR and Translation must be in such an odd spot right now...
LLMs don't yet universally outperform those task-specific models yet.. but it's pretty clear that they're on a path to.
.. So does Big Tech just freeze those products in place to wait?
24.11.2024 16:38 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
There's an interesting experiment waiting to be done w/ LLMs and OCR.
When OCRing a full-page of text w/ an LLM, it can go off the rails and, when it does - it usually stays off the rails.
Feels like an interesting substrate to create experiments to study hallucination.
24.11.2024 15:50 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Wow this looks really slick
21.11.2024 23:27 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
AI coworkers will interact on software timescales (immediate results), but also human timescales (I'll let you know by end-of-day)
We've been doing "agent progress bar" experiments at @everpilotapp that let you know what's happening & also invite you to collab with the agent.
20.11.2024 17:13 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Is this ticket transferrable? :p
20.11.2024 15:41 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
Forming a patent troll company filled with ML engineers and designers feels like an oddly high ROI endeavor at this moment in time.
I hope thatโs not happening right now..
20.11.2024 14:44 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The starter packs and feeds are awesome.
I think the final step in empowering users would be tags on posts that work with feeds.
I could auth a 3rd party service to tag posts to me to up/down rank them in my own feed.
Keep $USER but not their rants about $TOPIC
20.11.2024 14:21 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I will probably regret this, but..
Here is an agent that can negotiate prices, make package deals, and actually sell you candy online:
hawke.bot
Who wants to be the first human in history to buy something from an AI street hawker?
Blog post about it:
edwardbenson.com/2024/11/the-...
19.11.2024 16:13 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
Is there an easy button for porting over Twitter follows?
19.11.2024 03:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
๐ฌ Man of the internet; generalist.
๐ App updates and snark.
๐ซก Twitter sober since April '23.
#๏ธโฃ Invented the hashtag.
Intellectual mutt, making AI useful. Former SRI International, APL, ISI Foundation, Stanford, Johns Hopkins. 4x founder. Decamped to Tahoe. Here to serve.
Professor, Stanford University
Just Giving: Why Philanthropy is Failing Democracy
System Error: Where Big Tech Went Wrong
๐ I forecast water & climate risks. AMAโ๏ธ
๐ฑSenior TED Fellow, Truman Fellow, RAND Pardee fellow, Harvard adjunct๐ฆซ
โ๏ธSoCal-based, MN-born, 1st gen Punjabi๐ฅ
๐งWater Canary Founder๐ค
Climate is the message but #water is the medium: go.ted.com/sonaarluthra
Respawning. Former CEO of HumanFirst (acquired by ICON plc). Curious about Irish poetry and life sciences/biotech. I like birds.
AI/ML Practitioner. Formerly VP of ML @ Tegus. Cameo, ShopRunner, Civis Analytics, Braintree/Venmo, Obama 2012. Views are all mine.
The Roots of Progress (rootsofprogress.org)
idk tbd \\ marseille \\ pizza \\ grouchy
www.vaughntan.org
Reporter at The Information
Serial fintech operator & investor. Prev: MD at M12 (Microsoftโs VC fund); ran Visa Ventures; early exec at VGS; Twitter: https://twitter.com/peter
Technologist, artist, and founder. CTO at Replicant AI. Originator of Ruse Hacker Collective, maรฎtre d'Pup's Pool Party, and producer at Sublimate NYC.
currently: investing in technology and films
creator: Breadwinner, Emoji Dick, ๐ฆช
formerly: kickstarter, creative commons, y combinator
into: bread, data, surfing, literature, art
'not an artist per se' - The Guardian
Head of Responsible AI, CTO Office, Bloomberg.
Nerd. @mcandrew from the Other Place.
helping SMBs thrive w/ gen AI. owner @ midwestquality.consulting + board @ solvehungertoday.org | prev: partner @ IDEO.com ; SVP @ Newlab.com
Accountability Architect๐ก
Artisanal Database Maker๐ ๐งพ
Storyteller/Writer/Producer๐๏ธโ๏ธ๐ฅ
Avenger, Academic, Lawyer๐ณ๏ธ
Redistributive Justice โ๏ธ
Choice Optimizer๐ค
Researcher & OG Prompt Engineer๐ง
Whistleblower๐จ
Human Palantir๐ฎ
https://linktr.ee/realdearsarah
post-normal person
harper.lol / reading.lol / photos.lol / harper.blog
@harper on the other place
I'm on Germ DM ๐
https://ger.mx/A9bcnkcEv8ggK1BQeSIEBw3rJ6v1tZsJjeN1tA5NA7CU#did:plc:n6com3b6tkpq76vr5n7xqutu
Plays a mean game of Werewolf, also a VC who believes in worker power