How *you* (librarians and those working in publishing, law, and government) should use ActivityPub and Why


This is a follow up post to How I use ActivityPub (and Why)

Some years ago, I discovered the opened linked data work of the UK’s Parliamentary Digital Service, when I was looking for librarians who were engaged in the practice of making and sharing weeknotes.

Anya and Michael have been making weekly-ish exceptional weeknotes since 2019. Their latest weeknote – 2025 – Week 40 – is a great place to start because it starts with a rationale of the why behind their ongoing project.

A possible misnomer / starting from first principles

Our dear reader will be aware that for the past several years we have been attempting to build an open data platform. Please believe us when we add, it’s felt longer on the inside. A series of meetings this week caused us to question not only how we were going about that but also to what purpose. These are some notes on that.

Unlike many organisations, the open part of the equation has never been a difficulty. Parliament, like everywhere else, has non-open data in the form of HR, finance and facilities. That’s not the flavour of data we’re talking about. Our concerns relate to the procedural and the procedurally adjacent. The openness is helped by the existence of the Open Parliament Licence – think of the Open Government Licence, with the addition of a portcullis or two. Which itself is not so different from the Creative Commons Attribution licence. In our context, openness is a solved problem.

In many ways, having a platform to publish that open data is also a solved problem. Parliament’s Developer Hub makes available a set of open APIs, all published under the aforementioned Open Parliament Licence. It could be said then, that Parliament already has an Open Data Platform, at which point, you may be wondering why build another one. Well …

The problem we’re battling is not openness but Conway’s Law. Parliament’s approach to internet connected computers has tended to concentrate on digitising existing office functions. An example: the Journal Offices have always been responsible for the receipt of papers laid. Back in the olden days, this took the form of bundles of papers being transported down Whitehall and into Westminster. Happily, the process has been digitised, meaning a lot less shoe leather is used up. All good stuff.

Less good is the system still only deals with that one function. It neither cares about nor implements description of the procedures that commence once a paper is laid. This is true for a myriad of other systems. Hansard knows what was said and who said it, but carries no semantics about the debate. It does not know that some contributions formed part of a second reading debate, nor which bill that debate was on. Motions may also be tabled electronically, but there is no data connecting those motions to any debate that ensued. The motion having been disposed of by division, the division system does not have any data describing the motion being divided on, nor the debate that led to the decision. Committee reports published along the way do not carry any data about what was being reported on.

For anybody attempting to follow the passage of a piece of legislation or a treaty through Parliament, you’d typically need some idea of which committees might take an interest, what days those committees tend to meet and where they publish their reports. This is quite an overhead for people inside Parliament, and still worse for the uninitiated.

And forgive me as I quote myself now:

As Tim Knight wrote in Break On Through to the Other Side: The Library and Linked Data (2011), linked and semantic data is well suited for the purposes of meaning making in the field of law.

What we are missing, and what the work leading towards the semantic Web is all about, is “meaning”.
Lack of Meaning
In a law library “meaning” might look like this. When searching for a particular legislative act one
might expect to find, in addition to commentary written about the legislation, the associated
regulations, or each of the readings of the bill as it went through the House or Senate, or the
parliamentary debates that occurred with maybe the video or audio recordings as well as the print
transcripts. Maybe we'd want to focus on a particular role that an author had, for example as a primary
or secondary author in a law review article, or as the editor of a collection of continuing legal education
papers, or the commentator on a particular area of the law. But if these roles have not been identified
by the information system we will not be able to search for them.
And then there are any number of other meaningful relationships between resources that we may not
even be aware that we want. Robert Darnton offers a nice image when speaking about his archival
research:
“The manuscripts seem to stretch into infinity. You open a box, take out a folder, open the folder, take
out a letter, read the letter, and wonder what connects it with all the other letters in all the other folders in
all the boxes, not just in this repository but in all the archives everywhere.” 9
Our information systems should make these connections or at least allow researchers to add and share
the necessary bread crumbs that make meaningful connections as they do their research.

This is not a post about open linked data.

For today’s post, I would like the reader to pay attention to a particular bit of the work of the PDS: the creation of a small array of Mastodon accounts so those who are interested in tracking a topic or legislation can do so easily. They are found on this page: https://ukparliament.github.io/ontologies/meta/bots/

ontologies
Bot accounts from the House of Commons Library
Research services
Bluesky 	Mastodon 	Feed
@commonslibrary 	@commonslibrary 	RSS
@post-parliament 	@PostParliament 	RSS
Made-n-laid

Made-n-laid posts whenever a made statutory instrument is laid before Parliament.
Bluesky 	Mastodon
@madenlaid 	@madenlaid
Tweaty twacker

Tweaty twacker posts whenever a treaty subject to the procedure set out in the Constitutional Reform and Governance Act 2010 is laid before Parliament.

I would like to see more sets of pages that allow interested users to be kept up to date on recent events, publications, or accessions using ActivityPub.


There is no reason to be stingy with the creation of ActivityPub feeds.

So says Terrance Eden in his post The Fediverse of Things and I agree.

One of the most frustrating things in modern technology is the effort spent trying to artificially restrict abundance.

Take, for example, this tale from museum-worker Aaron Cope:

I was out with a friend who worked for Twitter and I asked them whether it would be possible for the museum to “create 200,000 Twitter accounts, one for each object in the Cooper Hewitt’s collection”. My friend looked at me for a moment, laughed, and then simply said: No.

In that blog post, Aaron reveals that the San Francisco International Airport Museum is using ActivityPub to create automated social-media bot accounts for all its exhibits and, possibly, every object it hold.

And why not! That would be close to impossible to do on a centralised service. But on a decentralised service under your own control, it is relatively simple. Perhaps I only want to follow the museum’s canteen, or I just want to engage with a specific artefact. The Fediverse makes that possible.

This reminds me of the Melbourne “treemail” phenomenon. Every tree in the city had an email address, ostensibly so residents could email maintenance issues for a specific tree. Instead, people started interacting with the trees and sending them little love notes!

Despite the forces that be that keep trying to turn the internet into TV, the internet is still a network and not a broadcasting service.

And yet the Canadian Broadcasting Corporation still holds on to an old-media paradigm that the only way one should deliver content is directly to only the largest audience possible. At least, reason that they give to explain why they remain on X.

The CBC acts as if hosting a dedicated Mastodon server that is bridged to Bluesky and other ActivityPub servers is something they could not possibly resource, when a service for their journalists could cost about $50 a month.

The representatives of the CBC do not address that the social media platform of X is literally dangerous to the well-being of millions of Canadians who are supposed to be broadcast to by the CBC.

It would do a world of good of those involved in media and social media had a better understanding of the standards and the protocols that are needed to make social networks safer for more people.

And for them, I have such a book recommendation for the occasion: Robert W. Gehl‘s Move Slowly and Build Bridges.

In his new book Move Slowly and Build Bridges, Robert W. Gehl tells the story of the activists, software developers, artists, and everyday people who have built the fediverse, a noncentralized alternative social media system. Unlike big tech corporations like Facebook, TikTok, or X, the fediverse is comprised of thousands of small, independent communities who use a Web protocol to communicate with one another.

I’ve not finished the book yet but I’m almost finished. I particularly appreciated learning more about how the standard of ActivityPub came about and how messy and miraculous it was and remains.

Fediverse Reactions

Leave a Reply

Your email address will not be published. Required fields are marked *