pax_global_header00006660000000000000000000000064131505754330014520gustar00rootroot0000000000000052 comment=3f6c25df8ec987b7dcd31cbeea14a2807de3d056 hunspell-bo-0.4.0/000077500000000000000000000000001315057543300137515ustar00rootroot00000000000000hunspell-bo-0.4.0/.gitignore000066400000000000000000000002401315057543300157350ustar00rootroot00000000000000*.changes *.xz *.gz *.deb *.upload *.dsc debian/hunspell-bo/* firefox/dictionaries/* lo/dictionaries/* *.xpi *.oxt *~ lo/README.md tmp-tibetan-spellchecker-git hunspell-bo-0.4.0/CHANGELOG.md000066400000000000000000000022271315057543300155650ustar00rootroot00000000000000# Change Log All notable changes to this project will be documented in this file. As of v0.3.0 this project adheres to [Semantic Versioning](http://semver.org/). It follows [some conventions](http://keepachangelog.com/). ## [0.4.0] - 2017-08-27 ### Added - སྨྲོས - སྨྲས - སྨྲེ/C - སྨྲེང - སྨྲངས ## [0.3.0] - 2016-08-31 ### Fixed - ཧྤ -> ཧྥ ### Added - more wasurs and འུ suffix possibilities - rare affixed particle combination འིའོ ### Changed - treat ཏྲ, མྲ, སྣྲ and སྨྲ as exceptions and list their possibilities ### Removed - remove erroneous དཀླ, བཏྲ and གཏྲ ## [0.2.0] - 2015-08-15 ### Changed - contains all possible "legal" Classical Tibetan syllables (coming from research in grammar books), not limitted to a dictionnary, see [tibetan-spellchecker](https://github.com/eroux/tibetan-spellchecker) ### Added - replacement proposals for archaic forms - main proper name syllables (not including Sanskrit names) ## [0.1.0] - 2013-06-15 - initial release, contains syllables from the བོད་རྒྱ་ཚིག་མཛོད་ཆེན་མོ།. hunspell-bo-0.4.0/LICENSE000066400000000000000000000146341315057543300147660ustar00rootroot00000000000000CC0 1.0 Universal Statement of Purpose The laws of most jurisdictions throughout the world automatically confer exclusive Copyright and Related Rights (defined below) upon the creator and subsequent owner(s) (each and all, an "owner") of an original work of authorship and/or a database (each, a "Work"). Certain owners wish to permanently relinquish those rights to a Work for the purpose of contributing to a commons of creative, cultural and scientific works ("Commons") that the public can reliably and without fear of later claims of infringement build upon, modify, incorporate in other works, reuse and redistribute as freely as possible in any form whatsoever and for any purposes, including without limitation commercial purposes. These owners may contribute to the Commons to promote the ideal of a free culture and the further production of creative, cultural and scientific works, or to gain reputation or greater distribution for their Work in part through the use and efforts of others. For these and/or other purposes and motivations, and without any expectation of additional consideration or compensation, the person associating CC0 with a Work (the "Affirmer"), to the extent that he or she is an owner of Copyright and Related Rights in the Work, voluntarily elects to apply CC0 to the Work and publicly distribute the Work under its terms, with knowledge of his or her Copyright and Related Rights in the Work and the meaning and intended legal effect of CC0 on those rights. 1. Copyright and Related Rights. A Work made available under CC0 may be protected by copyright and related or neighboring rights ("Copyright and Related Rights"). Copyright and Related Rights include, but are not limited to, the following: i. the right to reproduce, adapt, distribute, perform, display, communicate, and translate a Work; ii. moral rights retained by the original author(s) and/or performer(s); iii. publicity and privacy rights pertaining to a person's image or likeness depicted in a Work; iv. rights protecting against unfair competition in regards to a Work, subject to the limitations in paragraph 4(a), below; v. rights protecting the extraction, dissemination, use and reuse of data in a Work; vi. database rights (such as those arising under Directive 96/9/EC of the European Parliament and of the Council of 11 March 1996 on the legal protection of databases, and under any national implementation thereof, including any amended or successor version of such directive); and vii. other similar, equivalent or corresponding rights throughout the world based on applicable law or treaty, and any national implementations thereof. 2. Waiver. To the greatest extent permitted by, but not in contravention of, applicable law, Affirmer hereby overtly, fully, permanently, irrevocably and unconditionally waives, abandons, and surrenders all of Affirmer's Copyright and Related Rights and associated claims and causes of action, whether now known or unknown (including existing as well as future claims and causes of action), in the Work (i) in all territories worldwide, (ii) for the maximum duration provided by applicable law or treaty (including future time extensions), (iii) in any current or future medium and for any number of copies, and (iv) for any purpose whatsoever, including without limitation commercial, advertising or promotional purposes (the "Waiver"). Affirmer makes the Waiver for the benefit of each member of the public at large and to the detriment of Affirmer's heirs and successors, fully intending that such Waiver shall not be subject to revocation, rescission, cancellation, termination, or any other legal or equitable action to disrupt the quiet enjoyment of the Work by the public as contemplated by Affirmer's express Statement of Purpose. 3. Public License Fallback. Should any part of the Waiver for any reason be judged legally invalid or ineffective under applicable law, then the Waiver shall be preserved to the maximum extent permitted taking into account Affirmer's express Statement of Purpose. In addition, to the extent the Waiver is so judged Affirmer hereby grants to each affected person a royalty-free, non transferable, non sublicensable, non exclusive, irrevocable and unconditional license to exercise Affirmer's Copyright and Related Rights in the Work (i) in all territories worldwide, (ii) for the maximum duration provided by applicable law or treaty (including future time extensions), (iii) in any current or future medium and for any number of copies, and (iv) for any purpose whatsoever, including without limitation commercial, advertising or promotional purposes (the "License"). The License shall be deemed effective as of the date CC0 was applied by Affirmer to the Work. Should any part of the License for any reason be judged legally invalid or ineffective under applicable law, such partial invalidity or ineffectiveness shall not invalidate the remainder of the License, and in such case Affirmer hereby affirms that he or she will not (i) exercise any of his or her remaining Copyright and Related Rights in the Work or (ii) assert any associated claims and causes of action with respect to the Work, in either case contrary to Affirmer's express Statement of Purpose. 4. Limitations and Disclaimers. a. No trademark or patent rights held by Affirmer are waived, abandoned, surrendered, licensed or otherwise affected by this document. b. Affirmer offers the Work as-is and makes no representations or warranties of any kind concerning the Work, express, implied, statutory or otherwise, including without limitation warranties of title, merchantability, fitness for a particular purpose, non infringement, or the absence of latent or other defects, accuracy, or the present or absence of errors, whether or not discoverable, all to the greatest extent permissible under applicable law. c. Affirmer disclaims responsibility for clearing rights of other persons that may apply to the Work or any use thereof, including without limitation any person's Copyright and Related Rights in the Work. Further, Affirmer disclaims responsibility for obtaining any necessary consents, permissions or other rights required for any use of the Work. d. Affirmer understands and acknowledges that Creative Commons is not a party to this document and has no duty or obligation with respect to this CC0 or use of the Work. For more information, please see hunspell-bo-0.4.0/Makefile000066400000000000000000000015571315057543300154210ustar00rootroot00000000000000.PHONY: test error clean update firefox lo error: @echo "Please choose one of the following target: update, test, firefox, lo" all: update test firefox lo update: @echo "Update bo.dic with tibetan-spellchecker data" rm -rf tmp-tibetan-spellchecker-git git clone git@github.com:eroux/tibetan-spellchecker.git tmp-tibetan-spellchecker-git wc -l tmp-tibetan-spellchecker-git/syllables/*.txt |grep total | cut -d' ' -f2 > bo.dic cat tmp-tibetan-spellchecker-git/syllables/*.txt >> bo.dic rm -rf tmp-tibetan-spellchecker-git test: @echo "All the following lines should be stars:\n" hunspell -d bo -a tests/pass.txt @echo "\nNone the following lines should be stars:\n" hunspell -d bo -a tests/fail.txt firefox: $(MAKE) -C firefox lo: $(MAKE) -C lo clean: rm -rf tmp-tibetan-spellchecker-git $(MAKE) -C firefox clean $(MAKE) -C lo clean $(MAKE) -C debian clean hunspell-bo-0.4.0/README.md000066400000000000000000000036471315057543300152420ustar00rootroot00000000000000# Classical Tibetan syllable spellchecker for Hunspell You can find here the necessary files to use spell checking for Classical Tibetan at syllable level (not composed words) in [Hunspell](http://hunspell.sourceforge.net/) (used in [many applications](https://en.wikipedia.org/wiki/Hunspell#Uses)). Note that checking compound words for Tibetan is not possible with hunspell alone due to the absence of separation between words. ## Using #### Global installation Under Linux or OSX, you can install the spellchecker globally and benefit from it in most applications. - under Linux, copy `bo.dic` and `bo.aff` to `/usr/share/hunspell`, or use the `hunspell-bo` package if available in your distribution (under [Debian](https://packages.debian.org/search?keywords=hunspell-bo) for example). - Under OSX, copy `bo.dic` and `bo.aff` to `/Library/Spelling` and restart your machine. #### Application-specific installation - for Firefox, [an extension](https://addons.mozilla.org/fr/firefox/addon/tibetan-spellchecker/) is released - for LibreOffice/OpenOffice see [this extension](http://extensions.openoffice.org/en/project/tibetan-syllable-spell-checker), it is also in the [dictionaries repository](https://cgit.freedesktop.org/libreoffice/dictionaries/) - for Adobe products (>= CS5.5), see the instructions on [this page](http://blog.napsys.com/2012/11/adding-hyphenation-and-spelling.html) - for Chrome see [this feature request](https://bugs.chromium.org/p/chromium/issues/detail?id=662850) The sources for these extensions are in the [firefox](firefox/) and [lo](lo/) directories. To build them, run `make`. ## Building / Testing To rebuild `bo.dic` from the data in [tibetan-spellchecker](https://github.com/eroux/tibetan-spellchecker), run make update For a small test, run make test ## Changes See [CHANGELOG.md](CHANGELOG.md). ## License This work and the derived files are under the [Creative Commons CC0 license](LICENSE). hunspell-bo-0.4.0/bo.aff000066400000000000000000000032521315057543300150310ustar00rootroot00000000000000SET UTF-8 LANGCODE bo # ignoring Ux0F35 and Ux0F37, as they are extremely rare but valid. # This means that a syllable containing these signs at invalid places will not # be detected by the spell checker, but this makes the code more simple at # almost no cost IGNORE ༵༷ NOSPLITSUGS # The "N" makes the suffixes mandatory. It is used only with B, example: དཀ/NB, # in this case, དཀ is invalid, but is not if there is a suffix. NEEDAFFIX N # grammatical suffixes only SFX C Y 7 SFX C 0 འི . +'brel_sgra SFX C 0 འོ . +slar_bsdu SFX C 0 འིའོ . +'brel_sgra_slar_bsdu SFX C 0 ར . +la_don SFX C 0 ས . +byed_sgra SFX C 0 འང . +rgyan_sdud SFX C 0 འམ . +'byed_sdud # Words with mandatory vowel or suffix # If no ashung and no vowel, a suffix is mandatory SFX B Y 6 SFX B 0 ི/S . +i SFX B 0 ུ/S . +u SFX B 0 ེ/S . +e SFX B 0 ོ/S . +o SFX B 0 འ . +ashung SFX B 0 /NS . # Non-mandatory vowel or suffix. In these cases, no ashung is possible, see # the documentation. SFX A Y 5 SFX A 0 ི/S . +i SFX A 0 ུ/S . +u SFX A 0 ེ/S . +e SFX A 0 ོ/S . +o SFX A 0 /S . # List of suffixes used by both A and B SFX S Y 18 SFX S 0 ག . +g SFX S 0 གས . +gs SFX S 0 ང . +ng SFX S 0 ངས . +ngs SFX S 0 ད . +d SFX S 0 ན . +n SFX S 0 བ . +b SFX S 0 བས . +bs SFX S 0 མ . +m SFX S 0 མས . +ms SFX S 0 འི . +'brel_sgra SFX S 0 འོ . +slar_bsdu SFX S 0 འིའོ . +'brel_sgra_slar_bsdu SFX S 0 འང . +rgyan_sdud SFX S 0 འམ . +'byed_sdud SFX S 0 ར . +r SFX S 0 ལ . +l SFX S 0 ས . +s # Replace archaic forms by modern ones REP 6 REP འིས ས REP འའིས ས REP འར ར REP ནད ན REP རད ར REP ལད ལ hunspell-bo-0.4.0/bo.dic000066400000000000000000000110351315057543300150320ustar00rootroot00000000000000378 བགླ/C དམེའ མེའ མདྲོན བརྡའ བརྟའ ཏྲ/C ཏྲེས ཐྲིག མྲ/C སྨྲ/C སྨྲོས སྨྲས སྨྲེ/C སྨྲེང སྨྲངས སྨྲང སྣྲོན སྣྲུབས སྣྲེལ རྒྭ/C ཧྥ/A ཀརྨ/C པདྨ/C ཨཱ/C རྒྭ/C བསྭེ/C རྭང རྭི/C དྭང དྭ/C ཏྭོན ཀྭན ཀྭས ཧྭང དབའས ཀྲའུ/C ཀྲུའུ/C ཁྲུའུ/C སྒྱིའུ/C ཅོའུ/C གཅོའུ/C ཐུའུ/C དུའུ/C དྲིའུ/C ནོའུར ཕེའུ/C མུའུ/C མོའུ/C ཚུའུ/C ལོའུ/C ཧུའུ/C ཧེའུ/C ཧྲུའུ/C བྲའོ/C སླེའོ/C ཀའུ/C ཀིའུ/C ཀེའུ/C ཁིའུ/C ཁེའུ/C ཁྱིའུ/C ཁྱེའུ/C ཁྲིའུ/C ཁྲེའུ/C གའུ/C གྲིའུ/C གྲེའུ/C གླེའུ/C འགིའུ/C རྒེའུ/C སྒའུ/C སྒེའུ/C སྒྱེའུ/C སྒྲེའུ/C རྔེའུ/C སྔེའུ/C ཅེའུ/C གཅིའུ/C གཅེའུ/C ལྕེའུ/C རྗེའུ/C ཉེའུ/C སྙེའུ/C ཏེའུ/C གཏེའུ/C རྟའུ/C རྟེའུ/C སྟེའུ/C ཐའུ/C ཐིའུ/C ཐེའུ/C ཐོའུ/C མཐེའུ/C དེའུ/C དྲེའུ/C མདེའུ/C རྡེའུ/C ལྡེའུ/C སྡེའུ/C ནའུ/C ནེའུ/C སྣེའུ/C དཔེའུ/C སྤའུ/C སྤེའུ/C སྤྱིའུ/C སྤྲེའུ/C ཕྲའུ/C ཕྲེའུ/C འཕེའུ/C བེའུ/C བྱའུ/C བྱིའུ/C བྱེའུ/C བྲའུ/C བྲེའུ/C བྲོའུ/C འབེའུ/C སྦྲེའུ/C མིའུ/C མྱིའུ/C རྨེའུ/C སྨེའུ/C ཙིའུ/C ཙེའུ/C གཙེའུ/C རྩིའུ/C རྩེའུ/C ཚའུ/C ཚེའུ/C མཚེའུ/C མཚེའུ/C རྫིའུ/C རྫེའུ/C གཞུའུ/C ཟེའུ/C ཡེའུ/C གཡིའུ/C རེའུ/C ལའུ/C ལིའུ/C ལེའུ/C ཤའུ/C ཤེའུ/C སིའུ/C སེའུ/C སྲིའུ/C སླེའུ/C བསེའུ/C ཨའུ/C ཀ/A ཀྱ/A ཀྲ/A ཀླ/A དཀ/NB དཀྱ/A དཀྲ/A བཀ/NB བཀྱ/A བཀྲ/A བཀླ/A རྐ/A རྐྱ/A ལྐ/A སྐ/A སྐྱ/A སྐྲ/A བརྐ/A བརྐྱ/A བསྐ/A བསྐྱ/A བསྐྲ/A ཁ/A ཁྱ/A ཁྲ/A མཁ/NB མཁྱ/A མཁྲ/A འཁ/NB འཁྱ/A འཁྲ/A ག/A གྱ/A གྲ/A གླ/A དག/NB དགྱ/A དགྲ/A བག/NB བགྱ/A བགྲ/A མག/NB མགྱ/A མགྲ/A འག/NB འགྱ/A འགྲ/A རྒ/A རྒྱ/A ལྒ/A སྒ/A སྒྱ/A སྒྲ/A བརྒ/A བརྒྱ/A བསྒ/A བསྒྱ/A བསྒྲ/A ང/A དང/NB མང/NB རྔ/A ལྔ/A སྔ/A བརྔ/A བསྔ/A ཅ/A གཅ/NB བཅ/NB ལྕ/A ཆ/A མཆ/NB འཆ/NB ཇ/A མཇ/NB འཇ/NB རྗ/A ལྗ/A བརྗ/A ཉ/A གཉ/NB མཉ/NB རྙ/A སྙ/A བརྙ/A བསྙ/A ཏ/A གཏ/NB བཏ/NB རྟ/A ལྟ/A སྟ/A བརྟ/A བལྟ/A བསྟ/A ཐ/A མཐ/NB འཐ/NB ད/A དྲ/A གད/NB བད/NB མད/NB འད/NB འདྲ/A རྡ/A ལྡ/A སྡ/A བརྡ/A བལྡ/A བསྡ/A ན/A གན/NB མན/NB རྣ/A སྣ/A བརྣ/A བསྣ/A པ/A པྱ/A པྲ/A དཔ/NB དཔྱ/A དཔྲ/A ལྤ/A སྤ/A སྤྱ/A སྤྲ/A ཕ/A ཕྱ/A ཕྲ/A འཕ/NB འཕྱ/A འཕྲ/A བ/A བྱ/A བྲ/A བླ/A དབ/NB དབྱ/A དབྲ/A འབ/NB འབྱ/A འབྲ/A རྦ/A ལྦ/A སྦ/A སྦྱ/A སྦྲ/A མ/A མྱ/A དམ/NB དམྱ/A རྨ/A རྨྱ/A སྨ/A སྨྱ/A ཙ/A གཙ/NB བཙ/NB རྩ/A སྩ/A བརྩ/A བསྩ/A ཚ/A མཚ/NB འཚ/NB ཛ/A མཛ/NB འཛ/NB རྫ/A བརྫ/A ཝ/A ཞ/A གཞ/NB བཞ/NB ཟ/A ཟླ/A གཟ/NB བཟ/NB བཟླ/A འ/A ཡ/A གཡ/NB ར/A རླ/A བརླ/A ལ/A ཤ/A གཤ/NB བཤ/NB ས/A སྲ/A སླ/A གས/NB བས/NB བསྲ/A བསླ/A ཧ/A ཧྲ/A ལྷ/A ཨ/A ཀྭ/C ཀྭའི/C ཁྭ/C གྭ/C གྲྭ/C ཉྭ/C དྭོ/C དྭངས དྭགས དྲྭ/C ཕྱྭ/C རྩྭ/C ཚྭ/C ཚྭབ ཞྭ/C ཟྭ/C རྭ/C ལྭ/C ཤྭ/C སྭོ/C བསྭ/C བསྭོ/C ཧྭ/C ཧྭག ཧྭགས hunspell-bo-0.4.0/firefox/000077500000000000000000000000001315057543300154135ustar00rootroot00000000000000hunspell-bo-0.4.0/firefox/Makefile000066400000000000000000000004211315057543300170500ustar00rootroot00000000000000all: xpi xpi: install.rdf ../bo.dic ../bo.aff @echo "Making xpi file for firefox..." @mkdir -p dictionaries && cp ../bo.dic ../bo.aff dictionaries @zip -rq tibetan-spellchecker.xpi install.rdf dictionaries clean: @rm -rf dictionaries README tibetan-spellchecker.xpi hunspell-bo-0.4.0/firefox/README.md000066400000000000000000000003321315057543300166700ustar00rootroot00000000000000# Tibetan spellchecker firefox extension These are the files to build the firefox extension. See [documentation](https://developer.mozilla.org/en-US/Add-ons/Extension_Packaging) for format. To build it, run make hunspell-bo-0.4.0/firefox/install.rdf000066400000000000000000000035111315057543300175560ustar00rootroot00000000000000 bo@dictionaries.addons.mozilla.org true 0.4.0 64 Elie Roux https://github.com/eroux/hunspell-bo toolkit@mozilla.org 19.0 39.0 {ec8030f7-c20a-464f-9b0e-13a3a9e97384} 19.0 56.0 {3550f703-e582-4d05-9a08-453d09bdfdc6} 10.0 31.0 {aa3c5121-dab2-40e2-81ca-7ea25febc110} 19.0 39.0 Tibetan Spellcheck Tibetan syllable spellchecker. hunspell-bo-0.4.0/lo/000077500000000000000000000000001315057543300143635ustar00rootroot00000000000000hunspell-bo-0.4.0/lo/META-INF/000077500000000000000000000000001315057543300155235ustar00rootroot00000000000000hunspell-bo-0.4.0/lo/META-INF/manifest.xml000066400000000000000000000007671315057543300200650ustar00rootroot00000000000000 hunspell-bo-0.4.0/lo/Makefile000066400000000000000000000010271315057543300160230ustar00rootroot00000000000000all: oxt # https://wiki.openoffice.org/wiki/Documentation/DevGuide/Extensions/File_Format FILES=dictionaries.xcu description.xml package-description.txt oxt: $(FILES) ../README.md ../CHANGELOG.md ../bo.dic ../bo.aff META-INF/manifest.xml @echo "Making oxt file for LibreOffice..." @mkdir -p dictionaries && cp ../bo.dic ../bo.aff dictionaries @cp ../README.md . @zip -rq tibetan-spellchecker.oxt dictionaries $(FILES) META-INF README.md CHANGELOG.md clean: @rm -rf dictionaries README.md CHANGELOG.md tibetan-spellchecker.oxt hunspell-bo-0.4.0/lo/description.xml000066400000000000000000000016461315057543300174370ustar00rootroot00000000000000 Correcteur orthographique tibétain Tibetan spellchecker དག་ཆ་ཞིབ་བཤེར་བྱ་རིམ། Elie Roux hunspell-bo-0.4.0/lo/dictionaries.xcu000066400000000000000000000014661315057543300175700ustar00rootroot00000000000000 %origin%/dictionaries/bo.aff %origin%/dictionaries/bo.dic DICT_SPELL bo-CN bo-IN bo hunspell-bo-0.4.0/lo/package-description.txt000066400000000000000000000002231315057543300210350ustar00rootroot00000000000000This extension provides Classical Tibetan spellchecking at syllable level. You can file bugreports at https://github.com/eroux/hunspell-bo/issues hunspell-bo-0.4.0/tests/000077500000000000000000000000001315057543300151135ustar00rootroot00000000000000hunspell-bo-0.4.0/tests/fail.txt000066400000000000000000000001101315057543300165570ustar00rootroot00000000000000མདྲོམ བདྭག བདྲའ དཀ ཁནད པའིས hunspell-bo-0.4.0/tests/pass.txt000066400000000000000000000001451315057543300166220ustar00rootroot00000000000000མདྲོན དཀའ དཀའི དཀར བ བོ བར བོར བ༷ བ༵ བོ༷