The modula-2.xml file available at: https://kate-editor.org/syntax/update-5.36.xml does not actually represent Modula-2. Whoever made this xml file must have copied it from some other language and then modified it but without knowing Modula-2. The file does not represent any dialect of Modula-2. Anyway, I have made a replacement xml file for the classic Modula-2 dialect as defined in Niklaus Wirth's "Programming in Modula-2", 4th edition (most recent) published by Springer, Heidelberg. I have also made a test source file and tested this. It can be downloaded from my Github repo at: https://github.com/trijezdci/M2-Syntax-Colour/tree/master/Kate Known Limitation: Modula-2 permits nesting of comments, but it is not clear to me how to do this with Kate's syntax highlighting specification model.
Hi, cool that you provide a corrected file. Two things: 1) Would you be ok to contribute this under the MIT license? We can replace the existing file with your one then. 2) For the nested comment, you might just need to add the <Detect2Chars attribute="Comment" context="InComment" char="(" char1="*" beginRegion="FoldableComment"/> inside your InComment context, too. That should have the wanted effect.
Nice, thanks. And also MIT licensed as we prefer. We'll have a look soon.
Note that there are three main dialects: (1) classic Modula-2, known as PIM, last revised in 1985; (2) ISO Modula-2, formally ISO 10514, last revised in 1992; and (3) Modula-2 R10, an up-to-date modernised revision based on PIM. I have now added two more xml files (and their respective .mod test files) for the other two dialects. The language names for these have suffixes so that they can all co-exist within Kate and show up together in the Kate language menu. The PIM dialect being the original has the highest priority, so it will take precedence when loading .def or .mod files in Kate. Ideally, Kate would select the dialect from a dialect tag (a special comment) within Modula-2 source files. We have done this for Emacs and Vim, and also for the Pygments source code rendering framework. It is probably doable in Kate as well, only defining those items that are common to all dialects and put those that are different into dialect specific contexts, then switch to those contexts when a dialect tag is found in the source code. I intend to try this at some point but don't have the time right now. Thus, for the time being, users will have to select the dialect manually from the Kate language menu. I will work on the dialect detection later.
(In reply to Christoph Cullmann from comment #1) > > 1) Would you be ok to contribute this under the MIT license? We can replace > the existing file with your one then. I had already put this under MIT license. I didn't put a license file into the repo because this repo will also have the syntax profiles for other editors and rendering frameworks and each project has different preference on licensing which I always follow. So I only put a line "licensed under the MIT license" into the comment header. However, I have now added a LICENSE file with the MIT license into the Kate subdirectory of the repo. https://github.com/trijezdci/M2-Syntax-Colour/blob/master/Kate/LICENSE > 2) For the nested comment, you might just need to add the <Detect2Chars > attribute="Comment" > context="InComment" char="(" char1="*" > beginRegion="FoldableComment"/> inside your InComment context, too. That > should have the wanted effect. Thanks, I will try that.
> > 2) For the nested comment, you might just need to add the <Detect2Chars > > attribute="Comment" > > context="InComment" char="(" char1="*" > > beginRegion="FoldableComment"/> inside your InComment context, too. That > > should have the wanted effect. > > Thanks, I will try that. Ok, that worked. Thanks. I have now incorporated this in all three xml files and added a nested comment into their respective test (.mod) files. With this fix for the nested comments, the files should now be ready for incorporating into the Kate code base.
Great. Actually the license file is not needed, what we want is some license="MIT" attribute in the XML file. (still the license file doesn't hurt). We can in cooperate your files into syntax-highlighting if you are ok, if you later have updates, you just can send a patch via phabricator.kde.org or a bug.
I have added the license field in the language tag in all three xml files. https://github.com/trijezdci/M2-Syntax-Colour/commit/91dddc4e20d2b7de6fb9b416d0b9532b5beddc4c I have also changed the mime type from x-modula-2 to x-modula2 as I think the former is incorrect. On my system the registered mime type for Modula-2 is x-modula2 and the Pygments plugin also has x-modula2. I have asked a friend to check on his system and there, too, the mime type was x-modula2. Looking at your list of xml files for syntax highlighting profiles, other than Modula-2, there are eight other instances of languages with a hyphen in the name: ANS-Forth94 => ansforth94.xml : mimetype="" B-Method => bmethod.xml : mimetype absent Component-Pascal => component-pascal.xml : mimetype="text/x-component-pascal" MS-DOS Batch => dosbat.xml : mimetype="application/x-dos" G-Code => gcode.xml : mimetype="text/ngc" Objective-C => objectivec.xml : mimetype="text/x-objc-src;text/x-c-hdr" Objective-C++ => objectivecpp.xml : mimetype="text/x-objcpp-src;text/x-c-hdr" POC-Ray => povray.xml : mimetype="text/x-povray-script;text/x-povray-include" Of these, six have mimetypes, five of those drop the hypen in the mimetype, only one preserves it. Component-Pascal appears to be the outlier. I therefore suggest we drop the hyphen in the mimetype of Modula-2 as I have (tentatively) done in the latest commit. If you see any issue with this, please let me know and I will put the hyphen back in. One could of course argue that if the hyphen is dropped in the mimetype, then it should also be dropped in the filename to be consistent. The reason, I kept the hyphen in the filename is because when I changed this, Kate did not override the old incorrect Modula-2 xml entry with the replacement, but it showed the old one in addition to the new ones in the language menu. So, I had to keep the filename the same to override the old one. Since there are already Kate installations our there which were shipped with this file, I do not see how the old one could be overridden by the new one without keeping the same filename.
> One could of course argue that if the hyphen is dropped in the mimetype, > then it should also be dropped in the filename to be consistent. The reason, > I kept the hyphen in the filename is because when I changed this, Kate did > not override the old incorrect Modula-2 xml entry with the replacement, but > it showed the old one in addition to the new ones in the language menu. So, > I had to keep the filename the same to override the old one. > > Since there are already Kate installations our there which were shipped with > this file, I do not see how the old one could be overridden by the new one > without keeping the same filename. Perhaps I should have mentioned that the old Modula-2 entry is remembered by Kate even though there is NO xml file for it on my system. In fact, the syntax directory at /usr/share/org.kde.syntax-highlighting/syntax was empty when I started working on this, now there are only the three files I made in it. Yet, Kate remembers the old one when I change the filename.
Can you quickly tell what the naming conventions for xml syntax files is? Can they have periods in them other than the final .xml? For example, could we use ... modula-2.xml modula-2.pim.xml modula-2.iso.xml modula-2.r10.xml ?
I think a period is no issue for the code that searchs/uses the files. Thought to be consistent with what we have I would propose to use - like others.
(In reply to Christoph Cullmann from comment #10) > I think a period is no issue for the code that searchs/uses the files. > Thought to be consistent with what we have I would propose to use - like > others. Fair enough. I have now completed and tested a new profile that detects the dialect automatically, either by dialect tag (a specially formatted comment), or by reserved words or built-in identifiers that are indicative of a specific dialect. This file should be the default, and I have thus given it the generic name modula-2.xml. The other files are nevertheless useful since they permit manual dialect selection via the Kate language menu. I have named them as follows: modula-2-pim-only.xml modula-2-iso-only.xml modula-2-r10-only.xml With these files installed, the Kate language menu shows the following: Modula-2 Modula-2 (ISO-only) Modula-2 (PIM-only) Modula-2 (R10-only) I have committed all files to the repo at github and updated the README accordingly. https://github.com/trijezdci/M2-Syntax-Colour/blob/master/Kate/README.md This is now definitely ready to go into the Kate distribution ;-) I may revisit this in the future to do more code folding (at present only comments are foldable) but as far as syntax highlighting goes, this is as polished as it gets and I have already spent more time than I had budgeted for this, so this is it for now. Thanks for your assistance.
Thanks! I will add the hls soon to our repository. If you have any updates afterwards, just submit them via phabricator.kde.org for syntax-highlighting repo, simple patch upload is enough.
Git commit ad66b982178590e472e1aa9922eeb356e79f5cc6 by Christoph Cullmann. Committed on 01/09/2018 at 16:18. Pushed by cullmann into branch 'master'. update Modula-2 highlighting A +96 -0 autotests/folding/test-iso.mod.fold A +90 -0 autotests/folding/test-pim.mod.fold A +112 -0 autotests/folding/test-r10.mod.fold A +96 -0 autotests/folding/test.mod.fold A +103 -0 autotests/html/test-iso.mod.html A +97 -0 autotests/html/test-pim.mod.html A +119 -0 autotests/html/test-r10.mod.html A +103 -0 autotests/html/test.mod.html A +96 -0 autotests/input/test-iso.mod A +1 -0 autotests/input/test-iso.mod.syntax A +90 -0 autotests/input/test-pim.mod A +1 -0 autotests/input/test-pim.mod.syntax A +112 -0 autotests/input/test-r10.mod A +1 -0 autotests/input/test-r10.mod.syntax A +96 -0 autotests/input/test.mod A +96 -0 autotests/reference/test-iso.mod.ref A +90 -0 autotests/reference/test-pim.mod.ref A +112 -0 autotests/reference/test-r10.mod.ref A +96 -0 autotests/reference/test.mod.ref A +342 -0 data/syntax/modula-2-iso-only.xml A +308 -0 data/syntax/modula-2-pim-only.xml A +347 -0 data/syntax/modula-2-r10-only.xml M +1112 -108 data/syntax/modula-2.xml https://commits.kde.org/syntax-highlighting/ad66b982178590e472e1aa9922eeb356e79f5cc6
Hi, I pushed your stuff. I used the iso test file as test file for the "standard" highlighting, too. I fixed 3 issues in the modula-2.xml: attribute="Const" instead of the not known attribute="Constant"