I'm currently working on an update to Googlebar Lite, and I'm debating whether or not to drop some of the more outdated locales. I wrote a Perl script to gather some data about the locales, and this is what it reports as of my current development snapshot:
Locale Matches % Match Missing % Missing ------------------------------------------------ en-US 172 {Master Locale} ca-AD 24 14.0% 6 3.5% cs-CZ 12 7.0% 0 0.0% da 7 4.1% 0 0.0% de 8 4.7% 0 0.0% el-GR 32 18.6% 6 3.5% es-ES 4 2.3% 0 0.0% et-EE 14 8.1% 6 3.5% fr 13 7.6% 6 3.5% hr-HR 31 18.0% 6 3.5% it 23 13.4% 6 3.5% ja-JP 9 5.2% 6 3.5% nl 8 4.7% 0 0.0% pl 6 3.5% 6 3.5% pt-BR 31 18.0% 6 3.5% ru-RU 22 12.8% 6 3.5% sk-SK 32 18.6% 6 3.5% tr 30 17.4% 6 3.5% uk-UA 5 2.9% 0 0.0% zh-CN 30 17.4% 6 3.5% zh-TW 11 6.4% 6 3.5%
The 4 locales most out of date are el-GR, hr-HR, pt-BR, and sk-SK. Those are currently the ones I'm considering tossing out, but I also could make a case for tr and zh-CN. My non-scientific rule of thumb has been that I toss out locales that get to be more than 20% out of date (meaning that a sum total of 20% of strings either match or are missing). I'll give this a few more days to think about it. Do you have any ideas as to what I should do? If so, feel free to leave a comment below.