So if you are currently involved with ISA-level decisions about inclusion of any pext/pdep-like instructions:
Please consider including SAG/inverse-SAG with bit-reversal of the goats.
No matter which of the two implementation methods you are using: All you need to do is not mask the goat bits.
25.07.2025 23:30 โ ๐ 3 ๐ 3 ๐ฌ 0 ๐ 0
So if you are currently involved with ISA-level decisions about inclusion of any pext/pdep-like instructions:
Please consider including SAG/inverse-SAG with bit-reversal of the goats.
No matter which of the two implementation methods you are using: All you need to do is not mask the goat bits.
25.07.2025 23:30 โ ๐ 3 ๐ 3 ๐ฌ 0 ๐ 0
At some point that patent will expire, and until then there's my implementation.
(And I think my approach will still stay relevant after, because it makes it very simple to build multi-cycle SAG cores.)
25.07.2025 23:30 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
I think the really important takeaway is that they, my method and theirs, are both functionally the same, i.e. they both implement an SAG with bit-reversal of the goats.
This means, as far as the ISA is concerned, it's a safe decision to include an SAG instruction.
25.07.2025 23:30 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Wordle 1,498 4/6*
๐จโฌโฌโฌโฌ
โฌ๐ฉ๐จโฌ๐จ
๐ฉ๐ฉโฌโฌ๐ฉ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
... back to normal ^__^
25.07.2025 23:18 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Nah! No luck. ๐ I was unsure which of the 4 options I could come up with to pick, and as it turned out, it was none of them... ๐
But I'm still at 99% and that's all that really matters to me. (But tbh, it does matter to me way more than it probably should..)
25.07.2025 08:57 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Wordle 1,497 X/6*
โฌโฌ๐จ๐ฉโฌ
โฌ๐ฉโฌ๐ฉ๐ฉ
โฌ๐ฉโฌ๐ฉ๐ฉ
โฌ๐ฉโฌ๐ฉ๐ฉ
โฌ๐ฉโฌ๐ฉ๐ฉ
... Wahhh! I have ~23 hours to decide on a last guess... /o\
24.07.2025 23:20 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
Here is the code:
github.com/clairexen/ed...
I can't see any obvious reason why that identity should not extend beyond 8-bit units. But I have not actually tested that hypothesis yet.
24.07.2025 12:38 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
I have to make a correction regarding โ๏ธ. I've now implemented the Hilewitz-Lee method as well in my edu-sag repository. And it implements the bit-reflecting-SAG as-is. All you have to do is to remove the '&ci' from the data input, thus it's always more area to implement PEXT than bit-reflecting-SAG.
24.07.2025 12:38 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
Here is the code:
github.com/clairexen/ed...
I can't see any obvious reason why that identity should not extend beyond 8-bit units. But I have not actually tested that hypothesis yet.
24.07.2025 12:38 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
I have to make a correction regarding โ๏ธ. I've now implemented the Hilewitz-Lee method as well in my edu-sag repository. And it implements the bit-reflecting-SAG as-is. All you have to do is to remove the '&ci' from the data input, thus it's always more area to implement PEXT than bit-reflecting-SAG.
24.07.2025 12:38 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
Wordle 1,483 4/6*
๐ฉโฌ๐ฉโฌโฌ
๐ฉโฌ๐ฉ๐จ๐ฉ
๐ฉ๐ฉ๐ฉโฌ๐ฉ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
11.07.2025 09:22 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
Wordle 1,482 6/6*
โฌโฌโฌโฌโฌ
โฌโฌโฌโฌโฌ
โฌโฌ๐ฉ๐ฉ๐ฉ
โฌ๐ฉ๐ฉ๐ฉ๐ฉ
โฌ๐ฉ๐ฉ๐ฉ๐ฉ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
.. that was a bit unusual.
but I still got it in the end ^__^
09.07.2025 22:51 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
Wordle 1,474 4/6*
โฌ๐จโฌ๐จโฌ
๐จ๐จ๐จโฌโฌ
๐ฉ๐ฉโฌ๐จโฌ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
02.07.2025 05:03 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
Wordle 1,473 3/6*
โฌโฌโฌ๐จโฌ
๐ฉ๐ฉโฌ๐จ๐จ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
01.07.2025 08:04 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
Wordle 1,462 5/6*
โฌโฌ๐ฉโฌโฌ
โฌโฌ๐ฉโฌ๐ฉ
โฌโฌ๐ฉ๐จ๐ฉ
โฌ๐จ๐ฉ๐ฉ๐ฉ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
20.06.2025 17:19 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
Wordle 1,460 3/6*
โฌโฌโฌโฌ๐จ
๐ฉโฌ๐จ๐จโฌ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
... that one was fun
18.06.2025 17:45 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
To have trans friends is to hold candles in the wind.
I canโt say if the storm is fate or man-made.
I only know Iโm losing light.
09.06.2025 20:32 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
Wordle 1,449 4/6*
๐จโฌ๐จโฌโฌ
โฌ๐ฉ๐จโฌ๐จ
๐ฉ๐ฉ๐จ๐จโฌ
๐ฉ๐ฉ๐ฉ๐ฉ๐ฉ
07.06.2025 09:00 โ ๐ 1 ๐ 0 ๐ฌ 3 ๐ 0
For R-type instructions there's not much pressure there...
06.06.2025 18:18 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The wire crossings are there either way. It's literally the same network graph, just organized differently in the HDL, so that it's easier to see how similar the three stages of circuit are.
06.06.2025 12:08 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
..cost of the mask operation. At least IMO.
Thought-Experiment:
Would you rather have
1) SAG plus MaskedSAG, or
2) SAG plus all 4 PACK Instructions?
I'd certainly pick 2) and for most architectures it's even cheaper in terms of hardware cost than 1)... ๐ค
06.06.2025 12:04 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
But isn't it also "basically free" in software? It's literally just one instruction. ;)
On a superscalar OOO machine I think it'd be unlikely that one is bottle-necked by the additional mask op.
For an instruction like this one, what really matters is the avoided cost of ~500 cycles, not the..
06.06.2025 12:04 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
GitHub - clairexen/edu-sag: Educational 8-Bit Sheep-And-Goats (SAG) Verilog Reference IP
Educational 8-Bit Sheep-And-Goats (SAG) Verilog Reference IP - clairexen/edu-sag
Yes, I mean the definition of SAG with the mirroring. The non-mirrored version is way more expensive (and IMO actually less desirable anyway).
I just wrote this:
github.com/clairexen/ed...
I've been meaning to write this code for quite some time...
06.06.2025 09:27 โ ๐ 1 ๐ 1 ๐ฌ 2 ๐ 0
And those symmetries then can be exploited for things from re-use of completed layouts for functional units, to creating different multi-cycle versions of SAG HDL cores.
06.06.2025 09:27 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
I think it's better to use just the first stage of the (inverse) butterfly network and unshuffle, like I did it here, than to use an actual (inv) butterfly network, because this way it's much easier to understand all the symmetries within such an SAG HW implementation.
06.06.2025 09:27 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
GitHub - clairexen/edu-sag: Educational 8-Bit Sheep-And-Goats (SAG) Verilog Reference IP
Educational 8-Bit Sheep-And-Goats (SAG) Verilog Reference IP - clairexen/edu-sag
Yes, I mean the definition of SAG with the mirroring. The non-mirrored version is way more expensive (and IMO actually less desirable anyway).
I just wrote this:
github.com/clairexen/ed...
I've been meaning to write this code for quite some time...
06.06.2025 09:27 โ ๐ 1 ๐ 1 ๐ฌ 2 ๐ 0
If you don't do the (infamously patented) pruning of the prefix sum network, then SAG is the natural behavior of that kind of bfly/ibfly based pext/pdep Implementation.
Afair the Hilewitz06 paper only adds that you can get rid of some of the half adders if you don't care about goat bits order.. ๐ค
04.06.2025 21:48 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
I am a proud parent of 3 amazing humans, a trans lesbian with ASD, a sci-fi/horror writer, a social worker and an outspoken loud mouth for human rights and social justice. I love comics, movies, books and generally nerding out. Absolutely no DMs.
For the hardware girlies. ๐ฉ๐พโ๐ปโ๏ธ
DIY tech | Electronics | Coding
www.prettywiredbuilds.com
Code, Design, OpenSource https://exactcode.com https://t2sde.org https://exactscan.com https://ocrkit.com https://rene.rebe.de
The crafting side of @nilasae.bsky.social
DUNKEL = dark
BUNT = colorful
๐ณ๏ธโ๐
#Goth #Knitting #GothKnitting #Crochet #Needlepoint #Creative #Crafting
youtube.com/@dunkelbunt_creations
I wanna make art but all I have is this electrical engineering degree.
He/him
Engineer, roboethicist and pro-feminist. Interested in robots as working models of life, evolution, intelligence and culture. Prof Robot Ethics, Bristol Robotics Lab. Home page: https://people.uwe.ac.uk/Person/AlanWinfield
Mother, Gamer ๐ฎ, DJ ๐๏ธ and Nerd ๐ค. Openly Bisexual. Neurodiverse. Socially Awkward and Autistic. Carolina Panthers ๐. IT Guru ๐ป. Medically disabled. EDS. POTS. GP. Ambulatory ๐งโ๐ฆผ user. Medical advocate. DV Survivor. College Student ๐ โฉ Medical Lab Tech ๐ฌ๐งช๐งซ๐ฅผ.
Engineer; Debian & FreeCAD developer, MS CS '25 @ UIS
Things hacker, shit poaster
Hope I can help โฅ๏ธ
Clocks are a scam
๐ซ๐ซ๐ซDMs open๐ซ๐ซ๐ซ๐ซDo message ๐ซ๐ซ
she/they mafia ๐ณ๏ธโโง๏ธ
๐บ๐ธ/๐ท๐บ location: EU
Before a fight, you build.
๐ป Data Scientist || ๐งฎ Stats Engineer || ๐ฅ DevRel taipy.io || ๐ฆ Community Growth || open-source passionate || ๐งณ Travel || ๐ธ Photography
ML researcher @AMD and full stack engineer of a different kind - FPGA compilers, quantized deep learning. ๐ณ๏ธโ๐๐จ๐พ๐น๐ท in ๐ณ๐ด. Cat dad and freediver.
The Open Source Hardware Association (OSHWA) aims to foster technological knowledge and encourage research that is accessible, collaborative and respects user freedom.
Open Healthware Conference Aug 1 & 2in NYC & Online! https://healthware.oshwa.org/
Videos when I darn well please, 23 he/him. Hatter185 on the bird place.
Researcher in computational physics,,
Scientific director of Program Inria Quadrant
2018-2022 director of Inria Nancy G-Est,
ERC GOODSHAPE-VORPALINE,
#riscv ambassador
Amateur Radio | Formal Methods | Japan
๐ก FT8, learning CW | FT-891 & dual FT-818s for satellite ops
๐ฌ Yosys & Coq for hardware verification
๐ค Prolog & guarded command languages enthusiast
โฉ๏ธ Wakayama & Nara
๐ก Canterbury Shaker Village