Open to trying it out!
30.06.2025 14:26 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0@softwareengineering.ca.bsky.social
Exited startup founder giving it another whirl; software engineer. Particularly interested in productizing LLMs to simplify workflows.
Open to trying it out!
30.06.2025 14:26 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Vanna is good for the loading up the schema plus docs into a vector db for RAG part, just the charts part are weaker.
23.04.2025 11:06 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0I want the graph part too, thatโs where Vanna w/ plotly is failing. Itโs not rendering when I use booleans or timestamps and it does scatter plots at inappropriate times
23.04.2025 11:05 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Anyone got open source alternatives to vanna.ai for text to SQL?
The SQL part is pretty good but the plotly charts it recommends are wonky.
Iโve also been playing with dbt - and considered sqlmesh instead.
Chose to go deeper with dbt as I feel like sqlmeshโs real value shines when you want to avoid transforming the data both in dev and prod.
โฆ and Iโm just prototyping stuff locally to play with different open source BI frontends.
Last week I did some experimentation with unstructured.io to extract content out of some pdfs.
Also played with github.com/getomni-ai/z... as a more lightweight option.
Overall, vision models do a much better job than classic OCR (ex: tesseract) on tables in docs.
Hereโs a decent overview of the data pipeline orchestration tools on the market.
dataengineeringcentral.substack.com/p/review-of-...
Does this API exist in both the hosted and self-hosted versions?
When reading the docs last week I sometimes got mixed up in what features needed a subscription.
Nice! Hello!
Is the api giving you just the metadata of the metric or also translating to the SQL youโd run to build charts like you do in lightdash itself?
Lightdash may be a good UI option for me, but then Iโm defining metrics at the presentation layer and tightly coupled with it.
04.04.2025 21:25 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Iโm looking for something to be able to express KPIs centrally and cleanly, and have the BI layer autogenerated from it.
Dbt semantic layer could be that, but doesnโt seem like many open source BI layers support it.
I spent some time this week playing with some tools to setup a data pipeline for simple BI and data science.
Played with airbyte, dbt, duckdb and metabase.
Planning on trying lightdash next week.
Trying to avoid hosted data warehouses and use open source for the full chain.
Keep it up!
Iโve never felt motivated by hyping whatever Iโm building to peers whoโd never be clients/users.
I sold my solution by ignoring the sales rules and just flat out asking โhow do you do <process>?โ and replying we had an app for that when they outlined their manual process.
I felt dirty not outlining benefits at first, but I later realized this opener was better aligned with my ICP (operations)
My past experiment with a tax form was bad because the LLM used basic OCR instead of something fancier for tables.
And here Iโm trying to do something more generic without knowing the form format ahead of time.
Imagine a government form with some weird tabular layout to shove as many fields into a condensed space as possible.
Generalized use case is read/write to forms. Basically reverse engineering a domain model from a form.
Azure DocIntel lets you do it well for a known form via training.
If making zip files named _final-website(2)-final-tuesday-3.zip is too complicated maybe something like dropbox with revision history baked in could work - at least for one file at a time.
25.03.2025 22:14 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0Thatโs ironic but I guess totally expected since itโs their main revenue source hah
25.03.2025 11:37 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Interesting compensation model.
24.03.2025 12:17 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0#chordle sounded kinda sus
24.03.2025 01:12 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0#chloedle
24.03.2025 01:10 โ ๐ 2 ๐ 1 ๐ฌ 1 ๐ 0I published some notes on OpenAI's new text-to-speech and speech-to-text models. They're promising, but like other LLM-driven multi-modal models they appear to suffer from the prompt-injection-adjacent problem of mixing instructions and data in the same token stream
simonwillison.net/2025/Mar/20/...
I like hearing โI used X library to do Y and it worked/failedโ.
Helps broaden my perspective about whatโs out there.
I dislike hearing โdo like me and bathe in virgin blood to succeedโ.
I donโt know. Iโve been deep in home renos last couple weeks. Havenโt been checking here much!
21.03.2025 00:10 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0Youโre right, she seems gone!
20.03.2025 23:45 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0Itโs sometimes easier to get something extra that doesnโt cost the founders more in the short term, such as an extra week of vacation or extra stock options.
19.03.2025 11:20 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0My main lesson learned about economics is everyone is mostly wrong when predicting the future even if they have very logical reasons for their predictions.
Because predicting the future is hard.
But you still learn about the motivating factors behind peopleโs actions in the present.
Love that #econsky is here.
A few years ago, I started listening to podcasts about the economy during my runs and I feel like it massively helped me learn more about how the world works.
I should get into the penetration testing business because there will be so many security holes to find these coming years.
19.03.2025 01:05 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Cโest un peu inquiรฉtant de voir quโavec le climat gรฉopolitique qui se rรฉchauffe que le SCRS rรฉmunรจre ses agents la moitiรฉ ou moins que des postes au privรฉ www.canada.ca/fr/service-r...
19.03.2025 01:03 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0