Skip to Content.
Sympa Menu

coq-club - Re: [Coq-Club] Unicode tokens?

coq-club AT inria.fr

Subject: The Coq mailing list

List archive

Re: [Coq-Club] Unicode tokens?


Chronological Thread 
  • From: Vadim Zaliva <vzaliva AT cmu.edu>
  • To: coq-club AT inria.fr
  • Subject: Re: [Coq-Club] Unicode tokens?
  • Date: Wed, 8 Mar 2017 10:58:57 -0800
  • Authentication-results: mail2-smtp-roc.national.inria.fr; spf=None smtp.pra=vadim.zaliva AT west.cmu.edu; spf=Pass smtp.mailfrom=vadim.zaliva AT west.cmu.edu; spf=None smtp.helo=postmaster AT mail-wr0-f182.google.com
  • Ironport-phdr: 9a23:UIVOkh2Vu0De0ETAsmDT+DRfVm0co7zxezQtwd8ZsesQLfad9pjvdHbS+e9qxAeQG96KtrQY16GP6vyoGTRZp83e4DZaKN0EfiRGoPtVtjRoONSCB0z/IayiRA0BN+MGamVY+WqmO1NeAsf0ag6aiHSz6TkPBke3blItdaymUrLV2s+wzqW5/4DZSwROnju0J71ofzusqgCEiMAagI4qGK81ywXA6i9Wae1SxH1hDVmWglDx6trmr80ryDhZp/90r50Iaq79ZaltFbE=


On Wed, Mar 8, 2017 at 10:23 AM, Matej Kosik <5764c029b688c1c0d24a2e97cd764f AT gmail.com> wrote:
I guess that it is not the responsibility of the "grep" and friends to enable you to type unicode characters.
(it is the job of programs providing "input method", I guess, such as "xim", "uim" and many others).

Of course. As somebody who worked on Unicode support for one of Java compilers, I have also seen some other issues. For example, what is the default file encoding of .v files? Is it ASCII of Unicode? If Unicode what is the encoding (UTF-8/UTF-16/etc.). Could BOM be present? Is it mandatory? Is compiler output properly preserves Unicode in error messages? If in XML protocol used by IDE character offsets to source files are used, it should be understood that even in UCS-2 some characters can take more than 2 bytes (see "surrogates"), etc.


--
CMU ECE PhD candidate
Mobile: +1(510)220-1060




Archive powered by MHonArc 2.6.18.

Top of Page