From a59daedb44535c9b4be8ac9d9dbd26d61c265d4a Mon Sep 17 00:00:00 2001 From: bansp Date: Thu, 7 May 2026 03:04:32 +0200 Subject: [PATCH 1/4] add an extDoc --- SIS/clarin/data/formats/fODF.xml | 1 + 1 file changed, 1 insertion(+) diff --git a/SIS/clarin/data/formats/fODF.xml b/SIS/clarin/data/formats/fODF.xml index cacb4758..f56dafe2 100644 --- a/SIS/clarin/data/formats/fODF.xml +++ b/SIS/clarin/data/formats/fODF.xml @@ -9,6 +9,7 @@ umbrella format OASIS fdd000247 + OpenDocument

"The Open Document Format for Office Applications (ODF), also known as OpenDocument, standardized as ISO 26300, is an open file format for word processing documents, spreadsheets, presentations and graphics and using ZIP-compressed XML files. From cc90ad3b6cc581d4abcd5d73691eb134136c70b6 Mon Sep 17 00:00:00 2001 From: bansp Date: Sat, 23 May 2026 03:25:30 +0200 Subject: [PATCH 2/4] add extDoc and relations whose targets are not there yet (cf #479) --- SIS/clarin/data/formats/fCSV.xml | 3 +++ SIS/clarin/data/formats/fDICOM.xml | 10 +++++++--- SIS/clarin/data/formats/fTextPlain.xml | 1 + 3 files changed, 11 insertions(+), 3 deletions(-) diff --git a/SIS/clarin/data/formats/fCSV.xml b/SIS/clarin/data/formats/fCSV.xml index 16c2e306..ed27a58f 100644 --- a/SIS/clarin/data/formats/fCSV.xml +++ b/SIS/clarin/data/formats/fCSV.xml @@ -8,6 +8,8 @@ tabular format W3C fdd000323 + Comma-separated_values + CSV

"CSV is one of the most popular formats for publishing data on the web. It is concise, easy to understand by both humans and computers, and aligns nicely to the tabular nature of most @@ -38,6 +40,7 @@

  • CSV Dialect -- a set of modelling parameters for describing various dialects of CSV
  • + text/csv .csv Plain.Delimited diff --git a/SIS/clarin/data/formats/fDICOM.xml b/SIS/clarin/data/formats/fDICOM.xml index 50251358..b5f33151 100644 --- a/SIS/clarin/data/formats/fDICOM.xml +++ b/SIS/clarin/data/formats/fDICOM.xml @@ -11,16 +11,20 @@ Q28205908 fmt/574 + DICOM + DICOM -

    See https://en.wikipedia.org/wiki/DICOM.

    Please feel welcome to supply the description of this format file via GitHub: either as an issue report, or as a pull request after forking or browsing the code under the 'formats' branch.

    +

    Standardised both by ISO (ISO 12052) and NEMA (National Electrical Manufacturers + Association). (The standards relation link looks weird pending work on the proper visualisation. + Feel welcome to join us.)

    - application/dicom .dcm diff --git a/SIS/clarin/data/formats/fTextPlain.xml b/SIS/clarin/data/formats/fTextPlain.xml index cb25fd50..5c9378eb 100644 --- a/SIS/clarin/data/formats/fTextPlain.xml +++ b/SIS/clarin/data/formats/fTextPlain.xml @@ -7,6 +7,7 @@ text format Plain_text + Plain_text

    Plain text is a pure sequence of character codes. (...) Plain text represents character content only, not its appearance. (...) Plain text must contain enough information to permit From 88fa1a1f69809fa389ab2c59a19bdf0a5ecc3f25 Mon Sep 17 00:00:00 2001 From: bansp Date: Mon, 1 Jun 2026 16:36:58 +0200 Subject: [PATCH 3/4] info provided by @riccardodg + explanation of why the red notice is still there; closes #490 --- .../ILC4CLARIN-recommendation.xml | 77 ++++++------------- 1 file changed, 22 insertions(+), 55 deletions(-) diff --git a/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml b/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml index ffc84cff..3c877442 100644 --- a/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml +++ b/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml @@ -20,81 +20,48 @@ - +

    Formats extracted from the uploaded dataset and matched against + CLARIN Standards recommendations.

    +

    This list has been submitted by Riccardo Del Gratta in May 2026.

    +

    No active curator has yet been appointed for the recommendations.

    - - Audiovisual Source Language Data - recommended - - - Audiovisual Source Language Data + + Metadata recommended - - Audiovisual Source Language Data + + Textual Source Language Data recommended - - Documentation + + Text Annotation recommended - + Documentation - recommended + acceptable Documentation acceptable - - Documentation - recommended - - - Documentation - recommended - - - Image Source Language Data - recommended - - - Image Source Language Data - recommended - - - Image Source Language Data - recommended - - - Text Annotation - recommended - - - Text Annotation - recommended - - - Textual Source Language Data - recommended - - + Textual Source Language Data acceptable - - Textual Source Language Data - recommended + + Metadata + acceptable - - Tool Support - recommended + + Documentation + acceptable - + Packaging - recommended + acceptable - + Packaging recommended From dbf3fc0674907bd88e8c5fb046b9bebd480f7845 Mon Sep 17 00:00:00 2001 From: bansp Date: Mon, 1 Jun 2026 16:39:57 +0200 Subject: [PATCH 4/4] last-commit value update --- SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml b/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml index 3c877442..4ab8240c 100644 --- a/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml +++ b/SIS/clarin/data/recommendations/ILC4CLARIN-recommendation.xml @@ -2,7 +2,7 @@
    - 147cb5ea659a5752efd819898b43da23e00c6a77 + 88fa1a1f69809fa389ab2c59a19bdf0a5ecc3f25 ILC4CLARIN