<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">Hi All,<div class=""><br class=""></div><div class="">As far as WebLicht is concerned, the name for the input/output type feature is just that, a name. So when people want to introduce a new type it is left up to them how they want to call it, and I think the names suggested are good. </div><div class=""><br class=""></div><div class="">However, changing the tcf type feature name may cause some serious backwards compatibility issues, especially when using WaaS (WebLicht as a Service). Chains created with the “old” name probably will not work anymore.</div><div class=""><br class=""></div><div class="">So, regarding the new names WebLicht is fine with whatever the webservice developers decide on. But with regard to renaming the tcf type feature I think it may be problematic and I tend to think that it’s probably not worth it.</div><div class=""><br class=""></div><div class="">Best Regards,</div><div class="">Marie</div><div class=""><br class=""></div><div class=""><br class=""><div><blockquote type="cite" class=""><div class="">On 12.07.2016, at 10:14, Thomas Schmidt <<a href="mailto:thomas.schmidt@ids-mannheim.de" class="">thomas.schmidt@ids-mannheim.de</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class="">Dear all,<div class=""><br class=""></div><div class="">thanks, Dieter and Piotr for pointing out the application vs. text issue. It seems clear that "application/" is the preferrable variant, so we'll change that in our web services accordingly (might take a while due to holidays). </div><div class=""><br class=""></div><div class=""><span style="font-size:12.8px" class="">I think I'd like to carefully disagree regarding "existing" mime types:</span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class="">> We risk here to "invent" </span><span style="font-size:12.8px" class="">a new standard where some practice is already used in the wild.</span><br class=""></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class="">That "some" practice is used "in the wild" is, in my eyes, a strong reason for, rather than against, attempting to agree on a standard. As far as I can see, the use of mime types in CLARIN so far has been largely uncoordinated. If we could confirm that "</span><span style="font-size:12.8px" class="">text/exb+xml"and "text/tcf" are indeed the only mime types currently in use for EXMARaLDA and TCF files respectively, *and* if they were in use in more than one place, this might be a reason for accepting them as a de facto standard-like-practice. I doubt that we can and that they are, though. And if we find the need to agree on something and to change some existing data accordingly, that something might as well follow the same logic as the TEI format variant mime types.</span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class="">I don't have any strong passions regarding the "tokenized" parameter issue. For the TEI/ISO transcriptions, we could live with both approaches. I am a bit worried, though, that further such distinctions ("normalied"? / "lemmatized"? / "tagged"?) might lead to an uncontrollable proliferation of mime-type-format variants, where the aim of the TEI standardisation was actually to reduce variation.</span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class="">Best regards,</span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class="">Thomas</span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div><div class=""><span style="font-size:12.8px" class=""><br class=""></span></div></div><div class="gmail_extra"><br class=""><div class="gmail_quote">On Mon, Jul 11, 2016 at 7:10 PM, Piotr Bański <span dir="ltr" class=""><<a href="mailto:banski@ids-mannheim.de" target="_blank" class="">banski@ids-mannheim.de</a>></span> wrote:<br class=""><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Dear Dieter,<br class="">
<br class="">
Thank you so much for this catch. Indeed, on the one hand it's RFC 6129, and on the other, it's the "Architecture..." spec (in the previously quoted fragment [1]), that call for application/ in all cases. I'll modify this now in the wiki.<br class="">
<br class="">
[1]: <a href="https://www.w3.org/TR/webarch/#xml-media-types" rel="noreferrer" target="_blank" class="">https://www.w3.org/TR/webarch/#xml-media-types</a><br class="">
<br class="">
Best regards,<br class="">
<br class="">
P.<div class="HOEnZb"><div class="h5"><br class="">
<br class="">
On 11/07/16 17:58, Dieter Van Uytvanck wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
On 08/07/16 17:42, Piotr Bański wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
I have summarized Thomas's proposal at<br class="">
<a href="https://trac.clarin.eu/wiki/MIME%20format%20variants" rel="noreferrer" target="_blank" class="">https://trac.clarin.eu/wiki/MIME%20format%20variants</a><br class="">
</blockquote>
Thank you Piotr!<br class="">
<br class="">
Great to see this discussion moving forward. Before I start editing the<br class="">
wiki, let me first check and mention a few points:<br class="">
<br class="">
- right now it states "text/tei+xml" as mimetype for TEI; shouldn't that<br class="">
be "application/tei+xml" ?<br class="">
<br class="">
- we have a CMDI component where we have gathered (at least a subset of)<br class="">
CLARIN-relevant mime types:<br class="">
<br class="">
<a href="https://catalog.clarin.eu/ds/ComponentRegistry#/?itemId=clarin.eu%3Acr1%3Ac_1271859438106®istrySpace=public" rel="noreferrer" target="_blank" class="">https://catalog.clarin.eu/ds/ComponentRegistry#/?itemId=clarin.eu%3Acr1%3Ac_1271859438106®istrySpace=public</a><br class="">
<br class="">
It is not complete, but I will add it as a starting point to the trac<br class="">
(might take me a few days with the DH conference coming up)<br class="">
<br class="">
- I agree that the "format-variant=" is a necessary and elegant solution<br class="">
in case of e.g. the mimetype "application/tei+xml".<br class="">
<br class="">
- I am not sure about using this approach for general XML-based formats<br class="">
where no disambiguation on top of the mimetype is strictly necessary,<br class="">
since some form of mimeytype is already in use, (eg "text/tcf+xml", see<br class="">
<a href="https://vlo.clarin.eu/?q=text/tcf" rel="noreferrer" target="_blank" class="">https://vlo.clarin.eu/?q=text/tcf</a> or "text/exb+xml", see<br class="">
<a href="https://vlo.clarin.eu/search?q=text/exb%2Bxml" rel="noreferrer" target="_blank" class="">https://vlo.clarin.eu/search?q=text/exb%2Bxml</a>). We risk here to "invent"<br class="">
a new standard where some practice is already used in the wild.<br class="">
<br class="">
- Optional parameter(s) like "tokenized=0/1" were seen as problematic<br class="">
when discussing these with some of our developers - they can lead to<br class="">
arbitrary and unpredictable combinations. Maybe we can use something<br class="">
like "application/tei+xml;format-variant=dta-tokenized" instead? A major<br class="">
advantage of a finite list of format variants is that we can document<br class="">
every variant, eg with a link to an example file.<br class="">
<br class="">
best regards,<br class="">
</blockquote>
<br class="">
<br class="">
-- <br class=""></div></div><div class="HOEnZb"><div class="h5">
Piotr Bański, Ph.D.<br class="">
Senior Researcher,<br class="">
Institut für Deutsche Sprache,<br class="">
R5 6-13<br class="">
68-161 Mannheim, Germany<br class="">
<br class="">
</div></div></blockquote></div><br class=""><br clear="all" class=""><div class=""><br class=""></div>-- <br class=""><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr" class=""><div class="">Thomas Schmidt<br class="">IDS Mannheim<br class="">R5, 6-13<br class="">D-68161 Mannheim<br class="">Tel.: +49 (621) 1581-313<br class=""><a href="http://agd.ids-mannheim.de/index.shtml" target="_blank" class="">http://agd.ids-mannheim.de/index.shtml</a><br class=""><a href="http://www.exmaralda.org/" target="_blank" class="">http://www.exmaralda.org</a></div></div></div>
</div>
</div></blockquote></div><br class=""></div></body></html>