Jason Kealey's Avatar

Jason Kealey

@softwareengineering.ca.bsky.social

Exited startup founder giving it another whirl; software engineer. Particularly interested in productizing LLMs to simplify workflows.

355 Followers  |  563 Following  |  829 Posts  |  Joined: 24.11.2024  |  2.0161

Latest posts by softwareengineering.ca on Bluesky

Open to trying it out!

30.06.2025 14:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Vanna is good for the loading up the schema plus docs into a vector db for RAG part, just the charts part are weaker.

23.04.2025 11:06 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I want the graph part too, thatโ€™s where Vanna w/ plotly is failing. Itโ€™s not rendering when I use booleans or timestamps and it does scatter plots at inappropriate times

23.04.2025 11:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Vanna.AI - Personalized AI SQL Agent

Anyone got open source alternatives to vanna.ai for text to SQL?

The SQL part is pretty good but the plotly charts it recommends are wonky.

22.04.2025 23:21 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Iโ€™ve also been playing with dbt - and considered sqlmesh instead.

Chose to go deeper with dbt as I feel like sqlmeshโ€™s real value shines when you want to avoid transforming the data both in dev and prod.

โ€ฆ and Iโ€™m just prototyping stuff locally to play with different open source BI frontends.

16.04.2025 11:01 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Last week I did some experimentation with unstructured.io to extract content out of some pdfs.

Also played with github.com/getomni-ai/z... as a more lightweight option.

Overall, vision models do a much better job than classic OCR (ex: tesseract) on tables in docs.

16.04.2025 10:57 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Review of Data Orchestration Landscape ... and honest one ...

Hereโ€™s a decent overview of the data pipeline orchestration tools on the market.

dataengineeringcentral.substack.com/p/review-of-...

16.04.2025 10:52 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Does this API exist in both the hosted and self-hosted versions?

When reading the docs last week I sometimes got mixed up in what features needed a subscription.

10.04.2025 20:02 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Nice! Hello!

Is the api giving you just the metadata of the metric or also translating to the SQL youโ€™d run to build charts like you do in lightdash itself?

09.04.2025 19:58 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Lightdash may be a good UI option for me, but then Iโ€™m defining metrics at the presentation layer and tightly coupled with it.

04.04.2025 21:25 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Iโ€™m looking for something to be able to express KPIs centrally and cleanly, and have the BI layer autogenerated from it.

Dbt semantic layer could be that, but doesnโ€™t seem like many open source BI layers support it.

04.04.2025 21:25 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I spent some time this week playing with some tools to setup a data pipeline for simple BI and data science.

Played with airbyte, dbt, duckdb and metabase.

Planning on trying lightdash next week.

Trying to avoid hosted data warehouses and use open source for the full chain.

04.04.2025 21:20 โ€” ๐Ÿ‘ 8    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Keep it up!

Iโ€™ve never felt motivated by hyping whatever Iโ€™m building to peers whoโ€™d never be clients/users.

26.03.2025 11:44 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I sold my solution by ignoring the sales rules and just flat out asking โ€œhow do you do <process>?โ€ and replying we had an app for that when they outlined their manual process.

I felt dirty not outlining benefits at first, but I later realized this opener was better aligned with my ICP (operations)

26.03.2025 11:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

My past experiment with a tax form was bad because the LLM used basic OCR instead of something fancier for tables.

And here Iโ€™m trying to do something more generic without knowing the form format ahead of time.

25.03.2025 22:35 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Imagine a government form with some weird tabular layout to shove as many fields into a condensed space as possible.

Generalized use case is read/write to forms. Basically reverse engineering a domain model from a form.

Azure DocIntel lets you do it well for a known form via training.

25.03.2025 22:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

If making zip files named _final-website(2)-final-tuesday-3.zip is too complicated maybe something like dropbox with revision history baked in could work - at least for one file at a time.

25.03.2025 22:14 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Thatโ€™s ironic but I guess totally expected since itโ€™s their main revenue source hah

25.03.2025 11:37 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Interesting compensation model.

24.03.2025 12:17 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

#chordle sounded kinda sus

24.03.2025 01:12 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

#chloedle

24.03.2025 01:10 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
New audio models from OpenAI, but how much can we rely on them? OpenAI announced several new audio-related API features today, for both text-to-speech and speech-to-text. Theyโ€™re very promising new models, but they appear to suffer from the ever-present risk of ac...

I published some notes on OpenAI's new text-to-speech and speech-to-text models. They're promising, but like other LLM-driven multi-modal models they appear to suffer from the prompt-injection-adjacent problem of mixing instructions and data in the same token stream
simonwillison.net/2025/Mar/20/...

20.03.2025 20:41 โ€” ๐Ÿ‘ 58    ๐Ÿ” 7    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 1

I like hearing โ€œI used X library to do Y and it worked/failedโ€.

Helps broaden my perspective about whatโ€™s out there.

I dislike hearing โ€œdo like me and bathe in virgin blood to succeedโ€.

21.03.2025 11:02 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I donโ€™t know. Iโ€™ve been deep in home renos last couple weeks. Havenโ€™t been checking here much!

21.03.2025 00:10 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Youโ€™re right, she seems gone!

20.03.2025 23:45 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Itโ€™s sometimes easier to get something extra that doesnโ€™t cost the founders more in the short term, such as an extra week of vacation or extra stock options.

19.03.2025 11:20 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

My main lesson learned about economics is everyone is mostly wrong when predicting the future even if they have very logical reasons for their predictions.

Because predicting the future is hard.

But you still learn about the motivating factors behind peopleโ€™s actions in the present.

19.03.2025 01:16 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Love that #econsky is here.

A few years ago, I started listening to podcasts about the economy during my runs and I feel like it massively helped me learn more about how the world works.

19.03.2025 01:09 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I should get into the penetration testing business because there will be so many security holes to find these coming years.

19.03.2025 01:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Salaires et avantages sociaux - Canada.ca Des salaires concurrentiels et un ensemble dโ€™avantages flexible.

Cโ€™est un peu inquiรฉtant de voir quโ€™avec le climat gรฉopolitique qui se rรฉchauffe que le SCRS rรฉmunรจre ses agents la moitiรฉ ou moins que des postes au privรฉ www.canada.ca/fr/service-r...

19.03.2025 01:03 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@softwareengineering.ca is following 20 prominent accounts