Lien's Avatar

Lien

@liendata.bsky.social

Data. Semiconductor. Process mining. Homemade memes. Occasional artist. Skills of a data engineer and education of a data scientist... but I enjoy working as an overqualified data analyst instead πŸ¦„

482 Followers  |  862 Following  |  218 Posts  |  Joined: 13.11.2024  |  2.123

Latest posts by liendata.bsky.social on Bluesky

#objectcentric #processmining #pystackt #databs

05.08.2025 20:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ‘‘ Improve the docs. (I'm known as a queen of typos after all)

05.08.2025 20:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

🐍 Fix my code. (Apparently, it's not "pythonic" enough... or so I'm told)

05.08.2025 20:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

🎁 Contribute a data exporter to your favorite OCED format, or a data extractor for an open data source you're excited to dive into.

05.08.2025 20:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ“£ Shamelessly promote your compatible & open-source tool, method, use-case, ... by dropping a link in our docs.

05.08.2025 20:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - LienBosmans/pystackt: Data preparation for object-centric process mining Data preparation for object-centric process mining - LienBosmans/pystackt

πŸ“’ PyStack't is now open for contributions!

(To be honest, it always was, but now there is a contributing guide to make it more obvious and to help you get started.)

So, for anyone out there working on object-centric process mining, you're invited to 🧡

github.com/LienBosmans/...

05.08.2025 20:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Screenshot of draw a fish app after you let the fish swim. My fish is orange/red and has a cute black eye with a white highlight.

Screenshot of draw a fish app after you let the fish swim. My fish is orange/red and has a cute black eye with a white highlight.

Blub 🫧

25.07.2025 17:27 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Grey-banded Mining Bee. (Kruiskruidzandbij in Dutch.) Little bee sitting on a yellow flower.

Grey-banded Mining Bee. (Kruiskruidzandbij in Dutch.) Little bee sitting on a yellow flower.

Marmalade Hoverfly. (Snorzweefvlieg in Dutch.) Hoverflu with yellow, white and black stripes and big reddish eyes sitting on a blue flower.

Marmalade Hoverfly. (Snorzweefvlieg in Dutch.) Hoverflu with yellow, white and black stripes and big reddish eyes sitting on a blue flower.

Empis livida. (Akkerdisteldansvlieg in Dutch.) Looks like a hybrid between a mosquito and a fly, sitting on a white flower.

Empis livida. (Akkerdisteldansvlieg in Dutch.) Looks like a hybrid between a mosquito and a fly, sitting on a white flower.

Common Carder Bumblebee. (Akkerhommel in Dutch). Close up of a bumblebee hanging upside down on small white flowers.

Common Carder Bumblebee. (Akkerhommel in Dutch). Close up of a bumblebee hanging upside down on small white flowers.

This app allows me to relive my insect nerd childhood days and I'm loving it 😍

play.google.com/store/apps/d...

05.07.2025 15:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

"The translation from a business-oriented request into a data-oriented request should be on the data professional, not the stakeholder." and plenty more wisdom nuggets to be found here ⬇️

01.07.2025 18:12 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

If you're not sure how to interpret horizon plots, this gif is a nice, quick explainer: bernatgel.github.io/karyoploter_...

01.07.2025 10:01 β€” πŸ‘ 16    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0

#processmining #OCPM #pystackt

26.06.2025 07:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸͺΆ Even better documentation! "The docs maketh the tool" - "Write for the user base you want, not the user base you have" - "Fine docs make a fine tool" ..or something like that.

26.06.2025 07:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ§ͺ Experimental support for PromG! Getting the data in an event knowledge graph just got easier... Not easy breazy lemon squeezy, lots of room for improvement in future releases, hence that "experimental" over there, but definitely easier than in the previous version, when there was nothing for EKGs.

26.06.2025 07:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

😧 No more disappointment in the morning when your overnight GitHub log extraction failed because of GitHub API outages, forced system restarts or obscure errors... Thank you improved error handling and intermediate save functionality!

26.06.2025 07:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - LienBosmans/pystackt: Data preparation for object-centric process mining Data preparation for object-centric process mining - LienBosmans/pystackt

New PyStack't release! This one is called v0.1.0 and includes 🧡

github.com/LienBosmans/...

#databs

26.06.2025 07:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Four panel comic about 2 blue aliens on a science fair. First panel: Young being standing next to a poster says "I have attempted science" and teacher asks "Please explain". Second panel: Young being replies "I formed an idea and then discovered I was wrong". Third panel: teacher looks closely at the poster and remarks "there are numerous diagrams" to which young being replies "I was wrong in numerous ways". Fourth panel: Young being says "I produced a detailed tribute to my wrongness" to which teacher replies "That is science".

Four panel comic about 2 blue aliens on a science fair. First panel: Young being standing next to a poster says "I have attempted science" and teacher asks "Please explain". Second panel: Young being replies "I formed an idea and then discovered I was wrong". Third panel: teacher looks closely at the poster and remarks "there are numerous diagrams" to which young being replies "I was wrong in numerous ways". Fourth panel: Young being says "I produced a detailed tribute to my wrongness" to which teacher replies "That is science".

You can always use this comic from Nathan W. Pyle? πŸ˜„

25.06.2025 17:39 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Behaag je tuin | Home

In my neighborhood you can order local plants once a year, which we did for some of the hedges and shrubs, because it was cheap and easy πŸ˜„

I'll link it for anyone from Vlaams-Brabant, Belgium in case they are interested πŸ˜‰

www.behaagjetuin.be

24.06.2025 20:03 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Our garden is a mixed bag of local plants (like shrubs and hedges) and a wildflower patch for pollinators, but also some plain grass for playing and some non-local plants I like.

24.06.2025 19:55 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It's not always clear which plants are native when you go to the plant store, and I don't want to do a lot of research.

Sometimes you also want to plant stuff of which you have fond (childhood) memories, and, if it's not invasive, I think that's a valid reason to choose some non-local plants too.

24.06.2025 19:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Preview
Tegenwoordige Tijd Pommelien Thijs Β· Tegenwoordige Tijd Β· Song Β· 2025
22.06.2025 19:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I agree that's not where you want to start a career in data engineering but I actually really liked starting out as a data team of one to figure out which parts of the job I liked best before moving into an analyst focused role πŸ˜„

20.06.2025 10:35 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a drag queen is standing in front of an orange background and asking is it me ? am i the drama ? Alt: a drag queen is standing in front of an orange background and asking is it me ? am i the drama ?

It's funny how the answer to

"What is the problem with this function!?"

is almost always

"it's me, I am the problem"

#rstats #databs

20.06.2025 03:22 β€” πŸ‘ 17    πŸ” 3    πŸ’¬ 2    πŸ“Œ 0
19.06.2025 05:21 β€” πŸ‘ 16    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I built an open-source Python package to feed some data-starving researchers some real-life process data (activity from open-source GitHub repos) and, apart from a GitHub API outage, the testing is going surprisingly well.

18.06.2025 17:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Talking about a beautiful branch of mathematics :)

#mathematics #stem

18.06.2025 17:25 β€” πŸ‘ 28    πŸ” 5    πŸ’¬ 5    πŸ“Œ 1
Screenshot of terminal output.
18-06-2025 19:27    Extracting and mapping data for issue #17289 done ...66.5% (about 37679.9s remaining)
18-06-2025 19:27    Starting intermediate save process...
18-06-2025 19:27    Saving object-centric event data extracted from pandas-dev/pandas to DuckDB database file ./pandas_all_20250617.duckdb, schema stackt.
Overwriting schema 'stackt'
    Table object_types (4 records) done.
    Table object_attributes (15 records) done.
    Table objects (214020 records) done.
    Table object_attribute_values (643050 records) done.
    Table event_types (37 records) done.
    Table events (836399 records) done.
    Table event_attributes (37 records) done.
    Table event_attribute_values (251858 records) done.
    Table relation_qualifiers (7 records) done.
    Table event_to_object (1837081 records) done.
    Table object_to_object (50698 records) done.
    Table event_to_object_attribute_value (0 records) done.
18-06-2025 19:32    Extracting and mapping data for issue #17089 done ...66.8% (about 37327.3s remaining)
18-06-2025 19:36    Extracting and mapping data for issue #16941 done ...67.0% (about 37056.2s remaining)
18-06-2025 19:37    Extracting and mapping data for issue #16903 done ...67.1% (about 36988.2s remaining).

Screenshot of terminal output. 18-06-2025 19:27 Extracting and mapping data for issue #17289 done ...66.5% (about 37679.9s remaining) 18-06-2025 19:27 Starting intermediate save process... 18-06-2025 19:27 Saving object-centric event data extracted from pandas-dev/pandas to DuckDB database file ./pandas_all_20250617.duckdb, schema stackt. Overwriting schema 'stackt' Table object_types (4 records) done. Table object_attributes (15 records) done. Table objects (214020 records) done. Table object_attribute_values (643050 records) done. Table event_types (37 records) done. Table events (836399 records) done. Table event_attributes (37 records) done. Table event_attribute_values (251858 records) done. Table relation_qualifiers (7 records) done. Table event_to_object (1837081 records) done. Table object_to_object (50698 records) done. Table event_to_object_attribute_value (0 records) done. 18-06-2025 19:32 Extracting and mapping data for issue #17089 done ...66.8% (about 37327.3s remaining) 18-06-2025 19:36 Extracting and mapping data for issue #16941 done ...67.0% (about 37056.2s remaining) 18-06-2025 19:37 Extracting and mapping data for issue #16903 done ...67.1% (about 36988.2s remaining).

Getting closer...

18.06.2025 17:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Terminal output.
Start extraction of object-centric event data from all issues of repository pandas-dev/pandas using GitHub REST API via PyGitHub library.
Status updates will be displayed every 1% progress and every 5 minutes. While you wait, you can read about GitHub API rate limits here: https://docs.github.com/en/rest/using-the-rest-api/rate-limits-for-the-rest-api
17-06-2025 22:42    Starting data extraction for approximately 61672 issues ...
17-06-2025 22:42    Extracting and mapping data for issue #61672 done ...0.0% (about 78035.3s remaining)
17-06-2025 22:47    Extracting and mapping data for issue #61313 done ...0.4% (about 83959.7s remaining)
17-06-2025 22:52    Extracting and mapping data for issue #61045 done ...0.7% (about 85166.4s remaining)
17-06-2025 22:54    ⚠️ Skipping issue #60972 due to GitHub error: 404 {"message": "Not Found", "documentation_url": "https://docs.github.com/rest", "status": "404"}
17-06-2025 22:54    Starting intermediate save process...
17-06-2025 22:54    Saving object-centric event data extracted from pandas-dev/pandas to DuckDB database file ./pandas_all_20250617.duckdb, schema stackt.
Overwriting schema 'stackt'
    Table object_types (4 records) done.
    Table object_attributes (15 records) done.
    Table objects (3012 records) done.
    Table object_attribute_values (8934 records) done.
    Table event_types (30 records) done.
    Table events (8423 records) done.
    Table event_attributes (30 records) done.
    Table event_attribute_values (2273 records) done.
    Table relation_qualifiers (7 records) done.
    Table event_to_object (19064 records) done.
    Table object_to_object (715 records) done.
    Table event_to_object_attribute_value (0 records) done.
17-06-2025 22:56    Extracting and mapping data for issue #60827 done ...1.0% (about 85049.9s remaining)
17-06-2025 22:57    Extracting and mapping data for issue #60791 done ...1.1% (about 84820.6s remaining).

Terminal output. Start extraction of object-centric event data from all issues of repository pandas-dev/pandas using GitHub REST API via PyGitHub library. Status updates will be displayed every 1% progress and every 5 minutes. While you wait, you can read about GitHub API rate limits here: https://docs.github.com/en/rest/using-the-rest-api/rate-limits-for-the-rest-api 17-06-2025 22:42 Starting data extraction for approximately 61672 issues ... 17-06-2025 22:42 Extracting and mapping data for issue #61672 done ...0.0% (about 78035.3s remaining) 17-06-2025 22:47 Extracting and mapping data for issue #61313 done ...0.4% (about 83959.7s remaining) 17-06-2025 22:52 Extracting and mapping data for issue #61045 done ...0.7% (about 85166.4s remaining) 17-06-2025 22:54 ⚠️ Skipping issue #60972 due to GitHub error: 404 {"message": "Not Found", "documentation_url": "https://docs.github.com/rest", "status": "404"} 17-06-2025 22:54 Starting intermediate save process... 17-06-2025 22:54 Saving object-centric event data extracted from pandas-dev/pandas to DuckDB database file ./pandas_all_20250617.duckdb, schema stackt. Overwriting schema 'stackt' Table object_types (4 records) done. Table object_attributes (15 records) done. Table objects (3012 records) done. Table object_attribute_values (8934 records) done. Table event_types (30 records) done. Table events (8423 records) done. Table event_attributes (30 records) done. Table event_attribute_values (2273 records) done. Table relation_qualifiers (7 records) done. Table event_to_object (19064 records) done. Table object_to_object (715 records) done. Table event_to_object_attribute_value (0 records) done. 17-06-2025 22:56 Extracting and mapping data for issue #60827 done ...1.0% (about 85049.9s remaining) 17-06-2025 22:57 Extracting and mapping data for issue #60791 done ...1.1% (about 84820.6s remaining).

And we're back in business πŸ˜„

17.06.2025 20:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I'll take it as a sign to finally add that feature for incremental saves

17.06.2025 20:09 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It was not me to be clear, I'm nicely adhering to the API rate limits of 5000 requests an hour #databs

17.06.2025 20:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Screenshot from githubstatus.com, showing Major Outage for API Requests.

Screenshot from githubstatus.com, showing Major Outage for API Requests.

When you're 24 hours into an +- 30 hours stress-test data extraction and the GitHub API goes down, so it fails with a MaxRetryError 😭

17.06.2025 20:06 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@liendata is following 20 prominent accounts