Commit Graph

290 Commits

Author SHA1 Message Date
Amir Aharoni
7a0d559dc0 Add the Boyo Gagrai for the Ho language in the Warang Citi script
Bug: T233301
Change-Id: Iad328485531b97428f795b154f8181e71947a5fd
2020-11-30 15:31:31 +00:00
Niklas Laxström
8d5d63b996 Regular update of language name search database
Change-Id: I7325d18a2da7f3a84f3d43efcc84029586319acd
2020-10-20 08:17:48 +00:00
Niklas Laxström
d3b07d2ef0 Update language name search database
Change-Id: I2aaa0f975fc80ea42d88092f3bebfd505a48d253
2020-09-17 08:33:24 +00:00
Reedy
99937ba18b Add GochiHand font
Bug: T258839
Change-Id: I38eacbfbc3293c7b7a2a6bf4f680b2f70f1a43ad
2020-09-16 05:20:07 +00:00
Reedy
c7caa09b0b Remove comment about eot file format
No eot fonts in the repo, and I presume we don't support those versions of IE anymore either...

Change-Id: I6a2d45592933cc9d1dfa0d0d0a0618a32aaee693
2020-09-15 13:09:11 +01:00
David Kamholz
dc4937eb1f update Bali fonts to fix spacing issue
Change-Id: I221e9c2db3373ffc14a9f764e5de311e980a93a6
2020-07-27 14:29:15 -07:00
Umherirrender
44175839e5 Handle invalid value from LanguageNameIndexer::getLanguageData
Check preg_match return for failure and also check json_decode for
failure by checking for the index key needed for further processing.

Change-Id: Ifecf3fdaf476a05d0dc6fc1c40c3d8a2aa0fa88b
2020-06-02 20:28:25 +00:00
Niklas Laxström
6f11324b98 Update language name search database
Change-Id: Idec931027ae52e8f93dd989760466157d4880c22
2020-05-26 13:39:25 +02:00
David Kamholz
a1628b0af0 Balinese fonts: update Pustaka Bali and add new Kadiri font
Change-Id: Ica823ce359bcdacc4982faf6c19f37549be37088
2020-04-20 04:27:49 +00:00
David Kamholz
44ea2ceeb3 added Pustaka Bali and Vimala fonts for Balinese
Bug: T247142
Change-Id: Ie77a72ca532272662dd2c3f7f4d5a3ea2236f067
2020-03-09 05:47:15 +00:00
Niklas Laxström
81e7f9a888 Update language name search index with CLDR 36 data
Change-Id: I2a9ff49eb64917a4e11938e654b8d4d387f9a7c8
2019-11-26 10:58:19 +00:00
Niklas Laxström
d4786e5797 Update language name search index
Change-Id: I81fd17aa8d66a77b077f436c308702563b2b6693
2019-09-04 14:49:28 +00:00
Niklas Laxström
379f4e940a Update language name search index
Change-Id: I621dcbe7ec2b60543d6842834c2d8419c4512875
2019-08-26 07:15:24 +00:00
Niklas Laxström
cd5f6724c7 Update language name search index using CLDR 35.1
Change-Id: Iced51611124c59d29f2d5cd7f62cf6941af88d51
2019-05-27 10:14:21 +02:00
Niklas Laxström
6939354e16 Update language name search index
I noticed some language names are not searchable. I made it so
that autonyms from language-data are added to the search index.
Without this, languages not present in Names.php or in the CLDR
extension are not searchable via the API except by language code.

Change-Id: I51a9e2eb15fb40963e6edbf1db76133d84de7291
2019-05-21 17:21:21 +02:00
Santhosh Thottingal
da0aad3e03 Add fonts for Church Slavonic
Adding Ponomar, Fedorovs, Menaion and Monomakh restaurant.

Bug: T156843
Change-Id: I8f7c4861fc667bba3dfd4359f2c59633fa6f83ff
2019-04-01 12:39:33 -07:00
Umherirrender
fbf3f3b214 build: Updating mediawiki/mediawiki-codesniffer to 24.0.0
MLEB extensions now supporting php7

Change-Id: I582418eb4038114d0e751797bc17929ca4633867
2019-03-11 22:36:40 +01:00
Umherirrender
03bc19fab2 Use ::class for class name resolution
Available since php5.5

Change-Id: Ie71b7f9d289a34532f6e3e1f32ac25bf08b9903f
2019-03-02 20:11:12 +01:00
Thiemo Kreuz
f2e2e640c6 Avoid counting where not necessary
Change-Id: Iaae939780df26066de40e1584492865cb0ac80a7
2018-08-10 14:42:41 +00:00
Santhosh Thottingal
594aca60da Update Padauk font to version 3.003
SIL had released a new version of this font
https://software.sil.org/padauk/download/

Update the webfonts using the released TTF

Change-Id: I89d7db0fecae9cf290099d459dcc1f847113e329
2018-07-17 14:34:30 +05:30
libraryupgrader
38b449ceef build: Updating mediawiki/mediawiki-codesniffer to 20.0.0
Change-Id: I02db015a34f67a8b268feead090d2f6be5658935
2018-05-26 07:31:25 +00:00
Kunal Mehta
10a5865f9a Fix MediaWiki.Commenting.LicenseComment.InvalidLicenseTag errors
Change-Id: I2c868006d108b35adaa19d179bc6ebe95e29d0ef
2018-05-23 23:02:54 -07:00
Kartik Mistry
d00ddb89fd Documentation: Update sfntly URL and formatting
Change-Id: I1c821d7e20c7428f34495aed32f6cc091fd0109b
2018-05-07 12:29:12 +00:00
gopavasanth
5f153f0078 Change http://www.mediawiki.org/ to https://www.mediawiki.org/
In other files, links to mediawiki site are https://www.mediawiki.org/

Bug: T189687
Change-Id: Ie7771e8e97265da1c23395c579ee3f505f8fbb83
2018-04-24 15:55:14 +00:00
libraryupgrader
f97802e4e2 build: Updating mediawiki/mediawiki-codesniffer to 18.0.0
The following sniffs are failing and were disabled:
* MediaWiki.VariableAnalysis.UnusedGlobalVariables.UnusedGlobal$wgWBClientSettings

Change-Id: Ia5423c3d7ea419b3f073f35736de7a9379d4429a
2018-04-14 07:39:31 +00:00
libraryupgrader
858ebd5552 build: Updating mediawiki/mediawiki-codesniffer to 17.0.0
The following sniffs are failing and were disabled:
* MediaWiki.Commenting.LicenseComment.InvalidLicenseTag

The following sniffs now pass and were enabled:
* MediaWiki.Commenting.FunctionComment.MissingParamComment

Change-Id: I06e0542d737cec5e2500aad6d85f72951f8b584d
2018-03-29 06:53:52 +00:00
Kunal Mehta
67d470a9e7 Use SPDX 3.0 license identifier
SPDX released version 3 of their license list (<https://spdx.org/licenses/>),
which changed the FSF licenses to explicitly end in -only or -or-later
instead of relying on an easy to miss + symbol.

Bug: T183858
Change-Id: Id75a37eab27282ca7797fa3e6ceb92750c59477f
2018-03-03 19:27:21 +00:00
Niklas Laxström
1e15341fd1 Use dash as separator for non-prefix matches in language name search
Bug: T186480
Change-Id: Ib785e2b070e0c5a218b236be194417f0b1fbd102
2018-02-06 17:26:21 +01:00
Vagrant Default User
91a54767b6 Add aliases for Georgian, Armenian, Spanish, and Japanese
Also make it possible to add multiple aliases for a language.

Bug: T178996
Change-Id: I00bb4a158caed0c1ba15d41e294281a001c917b1
2018-01-18 14:45:26 +02:00
Kartik Mistry
d969c2a804 Remove non-default Saab font for Punjabi
* Not updated from upstream; last release is buggy as per upstream.
* Punjabi language has good font support in all modern operating
  systems.

Bug: T180422
Change-Id: I323953b4c6293df89034f05e0c2e13a96b42dc34
2017-12-05 06:19:13 +00:00
jenkins-bot
603cfea7d0 Merge "Improve ULS language search api" 2017-12-01 04:47:30 +00:00
Santhosh Thottingal
3bf7361262 LanguageNameSearch: Optimize levenshteinDistance
1. Do string comparison for equality early in the method so that we can
   do early return if it passes.
2. Move the zero length check for string up for early return. This may
   not have any significant change in performance though.

Change-Id: I86bdd612a4a31c5ebfac6bcd7687b829acc69cda
2017-11-30 16:38:41 +05:30
Niklas Laxström
e87dd20cdd Improve ULS language search api
* Store prefixes and infixes separately in the data
* First match language code, then prefixes, then infixes
* Try to use suggestion either in user language or autonym first
* use formatversion=2 to avoid escaping Unicode

Using Language::fetchLanguageName might can have a small
performance impact. On the other hand there is now check
to skip languages we already found, avoiding some fuzzy
matching.

This is in a preparation for a change in jquery.uls to use
the search API more, while trying to reduce the amount of
weird autocompletion suggestions we show to the user.

Bug: T73891
Change-Id: Id94c5352d9a591969bf90144d1d2d5e758d08301
2017-11-27 14:57:42 +01:00
Niklas Laxström
a353c5ab65 Perform search on every word of language name
See e.g. T132021. This favours coverage over quality.

Change-Id: I3fc8fb1702802bc002c3d7e2941563840914f325
2017-11-23 09:14:10 +00:00
Santhosh Thottingal
dc84413373 Remove Madan font for ne
* Unknown upstream
* Not updated for years
* ne has better support in operating systems
* Non-default font for ne

Bug: T180422

Change-Id: Ife0b81e4db3bc069752d89c53f4690ddcfad7ef3
2017-11-14 15:49:35 +01:00
Santhosh Thottingal
d5f0666025 Remove non-default Gubby font for Kannada and Tulu
Bug: T180422
Change-Id: I78af0a3889e48625ebb38b1b212cb8b454a5639c
2017-11-14 15:23:09 +01:00
Santhosh Thottingal
146426ffe7 Remove fonts for Odia(or) from fontrepo
Remove non-default Utkal font - Its upstream is unknown now.

Change-Id: Iefa9eeaf953d87d4a5c8766fa575d61f9bd96d2b
2017-11-13 15:31:27 +05:30
Santhosh Thottingal
c0bbd9efc1 Remove Tamil fonts from fontrepo
* These fonts are with no upstream now and little known among
  Tamil users.
* Tamil is very well supported in all operating systems
  now a days.
* Also reduce the metadata size for fontrepo

Change-Id: I4e7afb6476a4714f8d87bd2a048309b732883b2f
2017-11-13 15:26:59 +05:30
Santhosh Thottingal
28c0ba6bca Remove fonts for Malayalam from font repo
* I maintain these fonts in upstream and there we several releases
  since these fonts were added.
* Malayalam has better support in operating systems compared to 2012
  when these fonts were added.
* Reduce font metadata size for wikipedia pages when webfonts are
  enabled.

Change-Id: Ie5b54cc866b1c67849b094a9701b2c80d876b55f
2017-11-13 09:16:58 +00:00
Santhosh Thottingal
6bddc79773 Remove Lohit family of fonts from fontrepo
* The languages covered by these fonts are now available in all
operating systems.
* These fonts are not updated for years in our repo
* Saves the amount of font repo data we deliver for *every wikipedia
  page* when webfonts is enabled

Change-Id: Ia0f1b6acc4cf8b7a354671bea47b58425ab8c08e
2017-11-13 09:16:22 +00:00
Santhosh Thottingal
4f3461a9aa Remove autonym font and its usage
I no longer maintain the Autonym font.
Also remove the tofu detection.

Bug: T135464
Bug: T135465
Change-Id: I103aab40ea5f5fc403a7ee5b23d1b634cc9c6ee1
2017-11-13 08:03:12 +00:00
Niklas Laxström
56d3f2af43 Make output of LanguageNameIndexer more consistent
Change-Id: I13f06b9b1c65068206f1728f8a427c4ca46f28ec
2017-10-31 16:25:01 +01:00
Amire80
101532cfa6 Add special language names to facilitate searching
This adds several custom languages.

The addition of Punjabi addresses Bug T178070.

The addition of Chinese addresses Bug T73891.

Georgian and Catalan (Valencian) variant spellings
are added because these are the most frequent languages
that are not found in the ULS search box.

Bug: T73891
Bug: T178070
Change-Id: Ifbb08b560e454643d246379c19f725bde61917e9
2017-10-25 13:50:12 +05:30
Santhosh Thottingal
18c09bc6d3 Update language name data index with CLDR 31
Change-Id: I7c7b26a01b5c5780cbf7a19983388e16b4e97cc1
2017-10-24 17:52:29 +05:30
Umherirrender
7761a9e60b Improve some parameter docs
Change-Id: Icd8fd55cf1a4a83a6f674038e098b9be8257dc0c
2017-10-07 16:54:28 +02:00
Umherirrender
1a4ac5a6d6 build: Updating mediawiki/mediawiki-codesniffer to 0.10.1
Change-Id: Ib7a361cf2973bf0bba0fb8944762216f44c226a8
2017-07-26 23:22:26 +02:00
Kartik Mistry
eb8eed98e9 Add Sundanese font
Bug: T162221
Change-Id: Iabf1a22838bd4375be9c8ed3aabad9205523ef8e
2017-04-19 08:42:19 +00:00
Niklas Laxström
55b68c329d LanguageNameSearch: do not mix different scripts in same buckets
To keep the average and maximum bucket size low, I made codepoints
< 4000 more granular and code points >= 4000 less granular. This
could be tweaked further for sure to reach more even sized buckets.

Bucket stats before:
 - 773 buckets
 - smallest has 1 entries
 - largest has 1804 entries
 - median size is 66 entries
 - average size is 45.394566623545 entries

Bucket stats after:
 - 698 buckets
 - smallest has 1 entries
 - largest has 1792 entries
 - median size is 16 entries
 - average size is 50.272206303725 entries

Change-Id: Id62d93658117564b05294c2fe36ca7c182784859
2016-08-08 16:21:52 +02:00
Niklas Laxström
f73f9a8b5d LanguageNameIndexer: print bucket stats
Change-Id: If50b65b1bbda010f0dbde7d344edcb5bdcd382df
2016-08-08 13:38:53 +00:00
Niklas Laxström
bc7ee1ed19 LanguageNameIndexer: sort buckets
Change-Id: Ib33bc432d5f61de2fbb6e83f3566baebb184c441
2016-08-08 13:18:30 +00:00