[Standards] https://www.clarin.eu/content/standards-and-formats

Piotr Bański banski at ids-mannheim.de
Tue Jul 4 00:02:30 CEST 2017


Dear Dieter and Florian,

Some of this work is underway and, having just come back to the office 
after two weeks of absence, I intend to focus further on this issue this 
week. The "Benutzerhandbuch" will also be used for this, next to some 
other resources that are being remodelled. I will make sure to keep 
Florian in the loop on that.

Dear Florian, thanks a lot for the feedback!

Best regards,

   Piotr


On 07/03/17 17:09, Dieter Van Uytvanck wrote:
> Dear Florian,
>
> thank you very much for the feedback. I fully agree it would be good to
> include recommendations on media files and annotation formats. Therefore
> I am including our standards committee, which is responsible for this
> matter.
>
> In the meantime there is also quite a good bottom-up list of formats as
> recommended by various CLARIN centres that are hosting
> multimodal/multimedia datasets:
>
> https://trac.clarin.eu/wiki/FormatRegistry
>
> For those without access to the trac, I am including them inline:
>
> http://www.phonetik.uni-muenchen.de/Bas/BasFormatseng.html
> http://sldr-test.lpl-aix.fr/phpwiki/index.php/Formats
> https://cocoon.huma-num.fr/exist/crdo/formats.htm
> http://fedora.clarin-d.uni-saarland.de/ressources/AcceptedFormats.en.pdf
> https://corpora.uni-hamburg.de/drupal/en/corpus-hosting
> http://www.mpi.nl/corpus/html/lamus2/apa.html
> http://agd.ids-mannheim.de/uebernahme.shtml
>
> (more for brainstorming:
> https://docs.google.com/spreadsheets/d/1Tjmp_sEZDHIqnFAU1erx2VtzyYdjbKdM7RQGNIUbWhs/edit)
>
>
> It would be great if these guidelines could be somehow integrated into
> the Standards an Formats page and the Standard guidance website.
>
> best,
> Dieter
>
>
> On 30/06/2017 09:56, Florian Schiel wrote:
>> This regards CLARIN-wide standards and formats:
>>
>> Looking at the overview
>> https://www.clarin.eu/content/standards-and-formats is is remarkable
>> that no standards/formats of media files or media annotation files are
>> included. I also looked at the more extensive docu in
>> http://clarin.ids-mannheim.de/standards/ but to no avail, except the
>> mentioning of ELAN's standard EAF for video annotation.
>>
>> The German CLARIN-D 'Benutzerhandbuch' contains a small but useful
>> chapter 'Multimodal Corpora'
>> (https://www.clarin-d.net/de/sprachressourcen-und-dienste/benutzerhandbuch)
>> that could be adopted ...
>>
>> I regard this of importance since we receive more and more 'corpus data'
>> from users based on proprietary file formats that are not even
>> convertible outside the scope of certain OS or software environments.
>>
>> Maybe you can forward this to the authors of the overview, thanks.
>>
>>
>> Florian Schiel
>

-- 
Piotr Bański, Ph.D.
Senior Researcher,
Institut für Deutsche Sprache,
R5 6-13
68-161 Mannheim, Germany



More information about the Standards mailing list