Posts by RichardJActon@fosstodon.org
 (DIR) Post #AUzDWeDqXM1J7FRAMC by RichardJActon@fosstodon.org
       2023-04-24T16:58:05Z
       
       0 likes, 0 repeats
       
       @reality2cast Just listened to episode 144 and there's a great 'single source' publishing tool called Quarto https://quarto.org/ that can generate and HTML, pdf & ePub versions of your book from a markdown flavoured syntax. I've got gitlab CI configured so that if I push a new commit to my git repo it updates all the formats of the book automatically. I used it to write my first short tech book about research data: https://hdbi.gitlab.io/data-management/hdbi-data-resource/ directed at my colleagues in our research consortium.
       
 (DIR) Post #B5S3LJTGRLpxXvOX6O by RichardJActon@fosstodon.org
       2026-04-18T19:26:07Z
       
       0 likes, 0 repeats
       
       @cwebber @bkuhn @ossguy @richardfontana I'd don't see a great way out of the copyright stripping conclusions for them without changes to the law. As I understand their defense for training on copyrighted materials - it's predicated on the models being a "transformative" and not competing directly with the original works in the market. The models themselves don't compete with the training material only their outputs do - and the LLM companies want any liability for that to be on users not them.
       
 (DIR) Post #B5S3LKTehNkCfQaMLo by RichardJActon@fosstodon.org
       2026-04-18T19:31:09Z
       
       0 likes, 0 repeats
       
       @cwebber @bkuhn @ossguy @richardfontana Under this view it doesn't matter how the training data was licensed as it's a fair use defense. The outputs being uncopyrightable / effectively public domain allows people to claim they wrote it when it's convenient and they want to be able to copyright it as it's hard to prove if it was AI generated or human authored. And simultaneously to claim that it was the output of and LLM when they want to strip inconvenient licensing terms.