#objectcentric #processmining #pystackt #databs
05.08.2025 20:42 β π 0 π 0 π¬ 0 π 0@liendata.bsky.social
Data. Semiconductor. Process mining. Homemade memes. Occasional artist. Skills of a data engineer and education of a data scientist... but I enjoy working as an overqualified data analyst instead π¦
#objectcentric #processmining #pystackt #databs
05.08.2025 20:42 β π 0 π 0 π¬ 0 π 0π Improve the docs. (I'm known as a queen of typos after all)
05.08.2025 20:42 β π 0 π 0 π¬ 1 π 0π Fix my code. (Apparently, it's not "pythonic" enough... or so I'm told)
05.08.2025 20:42 β π 0 π 0 π¬ 1 π 0π Contribute a data exporter to your favorite OCED format, or a data extractor for an open data source you're excited to dive into.
05.08.2025 20:42 β π 0 π 0 π¬ 1 π 0π£ Shamelessly promote your compatible & open-source tool, method, use-case, ... by dropping a link in our docs.
05.08.2025 20:41 β π 0 π 0 π¬ 1 π 0π’ PyStack't is now open for contributions!
(To be honest, it always was, but now there is a contributing guide to make it more obvious and to help you get started.)
So, for anyone out there working on object-centric process mining, you're invited to π§΅
github.com/LienBosmans/...
Screenshot of draw a fish app after you let the fish swim. My fish is orange/red and has a cute black eye with a white highlight.
Blub π«§
25.07.2025 17:27 β π 3 π 0 π¬ 0 π 0Grey-banded Mining Bee. (Kruiskruidzandbij in Dutch.) Little bee sitting on a yellow flower.
Marmalade Hoverfly. (Snorzweefvlieg in Dutch.) Hoverflu with yellow, white and black stripes and big reddish eyes sitting on a blue flower.
Empis livida. (Akkerdisteldansvlieg in Dutch.) Looks like a hybrid between a mosquito and a fly, sitting on a white flower.
Common Carder Bumblebee. (Akkerhommel in Dutch). Close up of a bumblebee hanging upside down on small white flowers.
This app allows me to relive my insect nerd childhood days and I'm loving it π
play.google.com/store/apps/d...
"The translation from a business-oriented request into a data-oriented request should be on the data professional, not the stakeholder." and plenty more wisdom nuggets to be found here β¬οΈ
01.07.2025 18:12 β π 2 π 1 π¬ 0 π 0If you're not sure how to interpret horizon plots, this gif is a nice, quick explainer: bernatgel.github.io/karyoploter_...
01.07.2025 10:01 β π 16 π 2 π¬ 2 π 0#processmining #OCPM #pystackt
26.06.2025 07:36 β π 0 π 0 π¬ 0 π 0πͺΆ Even better documentation! "The docs maketh the tool" - "Write for the user base you want, not the user base you have" - "Fine docs make a fine tool" ..or something like that.
26.06.2025 07:35 β π 0 π 0 π¬ 1 π 0π§ͺ Experimental support for PromG! Getting the data in an event knowledge graph just got easier... Not easy breazy lemon squeezy, lots of room for improvement in future releases, hence that "experimental" over there, but definitely easier than in the previous version, when there was nothing for EKGs.
26.06.2025 07:35 β π 0 π 0 π¬ 1 π 0π§ No more disappointment in the morning when your overnight GitHub log extraction failed because of GitHub API outages, forced system restarts or obscure errors... Thank you improved error handling and intermediate save functionality!
26.06.2025 07:34 β π 0 π 0 π¬ 1 π 0New PyStack't release! This one is called v0.1.0 and includes π§΅
github.com/LienBosmans/...
#databs
Four panel comic about 2 blue aliens on a science fair. First panel: Young being standing next to a poster says "I have attempted science" and teacher asks "Please explain". Second panel: Young being replies "I formed an idea and then discovered I was wrong". Third panel: teacher looks closely at the poster and remarks "there are numerous diagrams" to which young being replies "I was wrong in numerous ways". Fourth panel: Young being says "I produced a detailed tribute to my wrongness" to which teacher replies "That is science".
You can always use this comic from Nathan W. Pyle? π
25.06.2025 17:39 β π 3 π 0 π¬ 1 π 0In my neighborhood you can order local plants once a year, which we did for some of the hedges and shrubs, because it was cheap and easy π
I'll link it for anyone from Vlaams-Brabant, Belgium in case they are interested π
www.behaagjetuin.be
Our garden is a mixed bag of local plants (like shrubs and hedges) and a wildflower patch for pollinators, but also some plain grass for playing and some non-local plants I like.
24.06.2025 19:55 β π 2 π 0 π¬ 0 π 0It's not always clear which plants are native when you go to the plant store, and I don't want to do a lot of research.
Sometimes you also want to plant stuff of which you have fond (childhood) memories, and, if it's not invasive, I think that's a valid reason to choose some non-local plants too.
I agree that's not where you want to start a career in data engineering but I actually really liked starting out as a data team of one to figure out which parts of the job I liked best before moving into an analyst focused role π
20.06.2025 10:35 β π 2 π 0 π¬ 0 π 0It's funny how the answer to
"What is the problem with this function!?"
is almost always
"it's me, I am the problem"
#rstats #databs
I built an open-source Python package to feed some data-starving researchers some real-life process data (activity from open-source GitHub repos) and, apart from a GitHub API outage, the testing is going surprisingly well.
18.06.2025 17:50 β π 0 π 0 π¬ 0 π 0Talking about a beautiful branch of mathematics :)
#mathematics #stem
Screenshot of terminal output. 18-06-2025 19:27 Extracting and mapping data for issue #17289 done ...66.5% (about 37679.9s remaining) 18-06-2025 19:27 Starting intermediate save process... 18-06-2025 19:27 Saving object-centric event data extracted from pandas-dev/pandas to DuckDB database file ./pandas_all_20250617.duckdb, schema stackt. Overwriting schema 'stackt' Table object_types (4 records) done. Table object_attributes (15 records) done. Table objects (214020 records) done. Table object_attribute_values (643050 records) done. Table event_types (37 records) done. Table events (836399 records) done. Table event_attributes (37 records) done. Table event_attribute_values (251858 records) done. Table relation_qualifiers (7 records) done. Table event_to_object (1837081 records) done. Table object_to_object (50698 records) done. Table event_to_object_attribute_value (0 records) done. 18-06-2025 19:32 Extracting and mapping data for issue #17089 done ...66.8% (about 37327.3s remaining) 18-06-2025 19:36 Extracting and mapping data for issue #16941 done ...67.0% (about 37056.2s remaining) 18-06-2025 19:37 Extracting and mapping data for issue #16903 done ...67.1% (about 36988.2s remaining).
Getting closer...
18.06.2025 17:40 β π 0 π 0 π¬ 0 π 0Terminal output. Start extraction of object-centric event data from all issues of repository pandas-dev/pandas using GitHub REST API via PyGitHub library. Status updates will be displayed every 1% progress and every 5 minutes. While you wait, you can read about GitHub API rate limits here: https://docs.github.com/en/rest/using-the-rest-api/rate-limits-for-the-rest-api 17-06-2025 22:42 Starting data extraction for approximately 61672 issues ... 17-06-2025 22:42 Extracting and mapping data for issue #61672 done ...0.0% (about 78035.3s remaining) 17-06-2025 22:47 Extracting and mapping data for issue #61313 done ...0.4% (about 83959.7s remaining) 17-06-2025 22:52 Extracting and mapping data for issue #61045 done ...0.7% (about 85166.4s remaining) 17-06-2025 22:54 β οΈ Skipping issue #60972 due to GitHub error: 404 {"message": "Not Found", "documentation_url": "https://docs.github.com/rest", "status": "404"} 17-06-2025 22:54 Starting intermediate save process... 17-06-2025 22:54 Saving object-centric event data extracted from pandas-dev/pandas to DuckDB database file ./pandas_all_20250617.duckdb, schema stackt. Overwriting schema 'stackt' Table object_types (4 records) done. Table object_attributes (15 records) done. Table objects (3012 records) done. Table object_attribute_values (8934 records) done. Table event_types (30 records) done. Table events (8423 records) done. Table event_attributes (30 records) done. Table event_attribute_values (2273 records) done. Table relation_qualifiers (7 records) done. Table event_to_object (19064 records) done. Table object_to_object (715 records) done. Table event_to_object_attribute_value (0 records) done. 17-06-2025 22:56 Extracting and mapping data for issue #60827 done ...1.0% (about 85049.9s remaining) 17-06-2025 22:57 Extracting and mapping data for issue #60791 done ...1.1% (about 84820.6s remaining).
And we're back in business π
17.06.2025 20:59 β π 0 π 0 π¬ 1 π 0I'll take it as a sign to finally add that feature for incremental saves
17.06.2025 20:09 β π 2 π 0 π¬ 1 π 0It was not me to be clear, I'm nicely adhering to the API rate limits of 5000 requests an hour #databs
17.06.2025 20:08 β π 1 π 0 π¬ 1 π 0Screenshot from githubstatus.com, showing Major Outage for API Requests.
When you're 24 hours into an +- 30 hours stress-test data extraction and the GitHub API goes down, so it fails with a MaxRetryError π
17.06.2025 20:06 β π 3 π 0 π¬ 1 π 0