help-circle
rss
GCHQ's CyberChef – use it for creating and scrambling (unit) test data!
While working on parsing news articles, that I cannot include with the code because they are copyrighted, I just realised that there are online tools to help "scramble" the contents so that the shape is the same. I'm still looking into it, but GCHQ's CyberChef looks very useful here.

A survey to understand how researchers use metadata for creating literary corpora

OpenAtlas is an open source database software developed especially to acquire, edit and manage research data from various fields of humanities like history, archaeology and cultural heritage…
First time I hear about OpenAtlas, which appears to be a competitor to [Nodegoat](https://nodegoat.net) and I guess [ResearchSpace](https://researchspace.org).

> My PhD supervisor once told me that everyone doing newspaper analysis starts by writing code to read in files from the 'LexisNexis' newspaper archive. However, while I do recommend this exercise, not everyone has the time.


An inventory of tools for converting data to RDF
FAIR does not imply the use of Linked Data or RDF, but it does seem useful. This is not just a list of tools, there is more context to most of them.

Allow the packet buffer size to be overridden by know-it-alls.
There are many interesting comments in this driver file.

I'm glad that when I thought "hmm, maybe I can optimise my code that generates IIIF annotations with [Pydantic](https://docs.pydantic.dev)", I remembered that I don't have to build my own `Model`s :D

SWIB23 Programme
Lots of interesting talks and workshops on the programme for Semantic Web in Libraries 2023!
2

These guidelines look simple and make your code not much more verbose.

EUacademy: Comparing Wikibase and Semantic Mediawiki for practical Linked Data applications
You could also watch the recordings of the hybrid event, but there is a certificate to be earned here...

Basically an awesome list of things you shouldn't do on GitHub.

GraphDB Free includes [spaCy](https://spacy.io)

Generate wide ranges of test scenarios with the Hypothesis testing framework for Python and Java
I heard about [Hypothesis](https://hypothesis.works/) at the recent [NL-RSE meetup on testing](https://nl-rse.org/events/2023-06-15-meetup) and it looks very useful. Consider this a promise that I will try it soon.

Registry of Digital Public Goods
Better than an awesome list, these are real public goods. Open Source software, Open Data, created for the public.

> To merge, or not to merge? That is the question. Whether 'tis nobler in the mind to squash first, or to just rebase. Our engineer @abelsiqueira gives us some hints for better #Git workflows. via https://akademienl.social/@eScienceCenter/110586846000303720

A tool to simulate how people with different kinds of vision impairments see colour combinations. Source: https://front-end.social/@marcamos/110550002603409576

How to run CI/CD (a little) more efficiently to help reduce CO2 emissions. By the Netherlands [@eScienceCenter@akademienl.social](https://akademienl.social/@eScienceCenter)

ZAP is a toolkit for application security testing. I learned about it on the [SE Radio podcast #568](https://www.se-radio.net/2023/06/se-radio-568-simon-bennetts-on-owasp-dynamic-application-security-testing-tool-zap/) and feel I should have our IT/security run this on webservices we offer.

First Public Working Drafts: RDF 1.2 Semantics and SPARQL 1.2 Entailment Regimes | W3C News
Using triples as subject or object of another triple is possible with RDF 1.2.