Three structural reasons digital archives fail communities
1. Search that doesn't understand people
Most archive search is keyword-based. You type a word, the system finds documents that contain that word. This sounds reasonable until you realise communities don't always know the right words. A family looking for records of their grandmother's work in the garment trade might search "Caribbean seamstresses in Hackney" and find nothing, even if the archive is full of relevant material filed under terminology a curator chose decades ago.
Semantic search changes this fundamentally. Rather than matching what you typed, it understands what you meant: surfacing material by concept and intent, not by literal string. A community member searching in plain language can find material that keyword search would bury. This isn't a marginal improvement. It's the difference between an archive that works and one that gatekeeps.
The catch is that semantic search has historically required significant infrastructure. Which is to say, it has been available to large institutions, not to the community organisations that need it most. Until now, that gap has been almost impossible to close with community budgets.
2. No provenance, no trust
Material gathered from communities, rehoused somewhere the originating community can no longer easily access, recatalogued in institutional terms, used in applications the community never consented to. The pattern is well-documented and it is still happening. Archives built on that legacy carry the weight of it, regardless of current intentions.
“Communities have good reason to be wary of institutions that hold their history. Extraction isn't a historical artefact: it's a current practice.
When a digital archive can't show you who contributed something, why it was collected, and who controls access to it, trust breaks down. And trust, once broken around heritage and history, is very difficult to rebuild.
Citable provenance isn't just a metadata question. It's an accountability structure. An archive that tells you where something came from, who contributed it, and what conditions govern its use is an archive that can be trusted. One that can't is asking communities to take institutional good faith on faith. Most communities have learnt not to.
The most misleading word in heritage technology is "preservation." It implies neutrality. But every decision about what gets preserved, how it is described, and who can access it is a political decision. Archives governed by institutions will reflect institutional priorities, however well-meaning those institutions are.
Community-controlled archives aren't a nice-to-have feature. They are a structural requirement if an archive is going to mean something to the people whose history it holds. This means communities deciding what goes in. Communities describing their own material, in their own terms. Communities setting access conditions. And communities retaining the right to say no: to extraction, to datafication, to institutional use cases that were never theirs.
“Control and access are not the same thing. Institutions have been offering communities access to their own history for a long time. That's not the same as giving them ownership of it.