Mary Holstege's Avatar

Mary Holstege

@mathling.mastodon.social.ap.brid.gy

Retired software engineer. Now I write programs to make art. Will natter on about tech in general with forays into linguistics, football, cooking, politics […] [bridged from https://mastodon.social/@mathling on the fediverse by https://fed.brid.gy/ ]

90 Followers  |  0 Following  |  452 Posts  |  Joined: 18.11.2024  |  1.2637

Latest posts by mathling.mastodon.social.ap.brid.gy on Bluesky

#balisage DW:

Pros:
Efficiency gains, accelerated development and knowledge transfer, great help system and conversion (but verify), ...

Cons:
Accuracy concerns, IP rights provenance, server energy consumption, ...

06.08.2025 19:35 — 👍 0    🔁 0    💬 0    📌 0

#balisage DW:

Example: Using CoPilot to help get familiarity with complex set of DITA customizations

06.08.2025 19:31 — 👍 0    🔁 0    💬 1    📌 0

#balisage DW:

The art of the prompt

Need to be succinct, specific, thorough, state the obvious, personae, break complex ideas down

06.08.2025 19:25 — 👍 0    🔁 0    💬 1    📌 0

#balisage DW:

How is AI useful to a publisher?

NOT for writing content! (IP, plagiarism, etc.)

But
poofreading
validation of facts and details
summarization
understanding
...

06.08.2025 19:22 — 👍 0    🔁 0    💬 1    📌 0
Balisage: How I Stopped Worrying & Learned to Love AI

#balisage

How I Stopped Worrying & Learned to Love AI (FP)—Dale Waldt

https://www.balisage.net/Proceedings/vol30/html/Waldt01/BalisageVol30-Waldt01.html

06.08.2025 19:14 — 👍 0    🔁 0    💬 1    📌 0

#balisage AF:

Neo4j graph database
BGE-M3 embedding model
FAISS vector indexer (vector search)
Qwen2.7B: the LLM

06.08.2025 19:09 — 👍 0    🔁 0    💬 0    📌 0

#balisage EB:

Designing a RAG with TEI

Taking TEI Guidelines p5.xml => digitai-p5.json + query scripts to create knowledge graph (for Neo4j) to instruct local LLM

06.08.2025 19:05 — 👍 0    🔁 0    💬 1    📌 0

#balisage AF:

DigitAI a tool to assist scholars and editors to applying TEI guidelines

Small "explainable AI"
Lightweight
Obedient
Multilingual
Interpretable
Modular
Open-Source

Ollama from Meta; missing key languages
Now Qwen2 models Qwen2:7b and Qwen2:7b-instruct

06.08.2025 19:02 — 👍 0    🔁 0    💬 1    📌 0

#balisage

Elisa Beshiro-Bondar and Alexander Fisher
Digital AI for Localized TEI/XML Assistance

06.08.2025 19:00 — 👍 0    🔁 0    💬 1    📌 0
Preview
GitHub - AndrewSales/XQS: XQuery implementation of Schematron XQuery implementation of Schematron. Contribute to AndrewSales/XQS development by creating an account on GitHub.

#balisage AS:

XQS implementation

XQuery for Schematron
Native XQuery implementation of ISO Schematron

XQS version 1.0 released
https://github.com/AndrewSales/XQS

06.08.2025 18:54 — 👍 0    🔁 0    💬 0    📌 0
What's New In Schematron 4

#balisage AS:

Now at Final Draft International Standard

Outline of changes
https://andrewsales.com/schematron4/index.html

06.08.2025 18:51 — 👍 0    🔁 0    💬 1    📌 0

#balisage Open Mike

Andrew Sales: Schematron 4 update and implementation

06.08.2025 18:50 — 👍 0    🔁 0    💬 1    📌 0

#balisage GB:

Also extension functions to call AI, therefore available in XSLT, XQuery, XProc, etc. etc.

06.08.2025 18:40 — 👍 0    🔁 0    💬 0    📌 0

#balisage GB:

DITA for AI prompts

Both XML and Markdown for topic formats; reuse/sharing

06.08.2025 18:37 — 👍 0    🔁 0    💬 1    📌 0

#balisage GB:

Oxygen AI Positron: AI plugin for Oxygen tools

06.08.2025 18:34 — 👍 0    🔁 0    💬 1    📌 0

#balisage Open Mike

George Bina: Structured Content and XML Technologies in the Context of AI Processing

06.08.2025 18:31 — 👍 0    🔁 0    💬 1    📌 0

#balisage DL:

JSON map as argument to call to JS function

Supplying XDM map in ixml:call doesn't work ("reasons")

SaxonJS 3 adds ixsl:json-parse() to construct JSON objects from XSL side

06.08.2025 18:23 — 👍 0    🔁 0    💬 0    📌 0

#balisage DL:

Document loading using doc() with request headers

'pool' option for HTTP requests; value
'xml' to add to documentPool so it can be accessed via doc()

06.08.2025 18:19 — 👍 0    🔁 0    💬 1    📌 0

#balisage DL:

Combining results of processing multiple source documents: new language features in SaxonJS 3 => cleaner solution

ixsl:promise
@Select using ixsl:all function to parallelize fetches
@on-completion called when done
@on-failure called on failure

06.08.2025 18:13 — 👍 0    🔁 0    💬 1    📌 0
Program | Declarative Amsterdam

#balisage DL:

SaxonJS Tutorial:
https://declarative.amsterdam/program-2021

06.08.2025 18:07 — 👍 0    🔁 0    💬 1    📌 0

#balisage

Sponsor Presentation: Saxonica -
SaxonJS 3 coding improvements—Debbie Lockett

06.08.2025 17:58 — 👍 0    🔁 0    💬 1    📌 0
Original post on mastodon.social

#balisage SD:
So... .what about programming languages?

Why? Turn off spell check, don't mess with punctuation, whitespace, syntax highlighting
Where? Same carrier as for NL: should be able to without conflict
How? Standard codes/names would be nice; little regional variation but many versions […]

06.08.2025 16:11 — 👍 0    🔁 0    💬 0    📌 0

#balisage SD:
Natural language identification

Why? Get right spelling, punctuation, font...
Where? xml:lang, Markdown ```, file systems extended attributes
How? Standard (ISO) values for language, country, script codes

06.08.2025 16:06 — 👍 0    🔁 0    💬 1    📌 0
Balisage: Language Identification for Program Code in Documents

#balisage

Language Identification for Program Code in Documents (LB)—Steven J. DeRose

https://www.balisage.net/Proceedings/vol30/html/DeRose02/BalisageVol30-DeRose02.html

06.08.2025 15:57 — 👍 0    🔁 0    💬 1    📌 0

#balisage HB:
Futures:
More parsers, integration of external tools
Plugin system for user-specific extension functions
Interactive visualization of expression values implemented as HTML pages

06.08.2025 15:27 — 👍 0    🔁 0    💬 0    📌 0

#balisage HB:

Definition document includes fall back parsing, e.g. try parse HTML as XML, then fall back to HTML, etc.

So now can navigate seamlessly into various file types

e.g. into CSS and navigate to properties

06.08.2025 15:23 — 👍 0    🔁 0    💬 1    📌 0

#balisage HB:

How to get seamlessness?

Need to map file URI => method of parsing

Based on file extension or pattern, folder name pattern (project-specific rules)

=> Infospace definition document

Define resource types
Grammars
Resource type uses (mapping)

06.08.2025 15:20 — 👍 0    🔁 0    💬 1    📌 0

#balisage HB:

Addressing limited file formats => Invisible XML

So add functions:
idoc() parse with ixml grammar
iexpand-nodes() expansion of text in tree into element nodes with ixml grammar
ivalid() validate strings against ixml grammar

06.08.2025 15:18 — 👍 0    🔁 0    💬 1    📌 0

#balisage HB:
Current limitations

1. Only subset of files can be parsed
2. Explicit call of parse functions distracts from pure navigation

06.08.2025 15:16 — 👍 0    🔁 0    💬 1    📌 0

#balisage HB:
Extension functions, e.g.
freq() => frequency distribution with counts
path-content-ec() => all paths in nodes with counts
resolve-link() => from content to file

06.08.2025 15:12 — 👍 0    🔁 0    💬 1    📌 0