babel/old/mirror: scripts/import

annotate scripts/import_cldr.py @ 29:cbda87af9aa0

Changing `write_po` to include licensing info, same as the project. Since the method is also to create the initial pot file, we also include the year in the copyright. We also mark the translations catalog template as fuzzy as it should be, only localized translations catalogs ready to be compiled should not include the fuzzy bit on the header.

author	palgarvio
date	Sun, 03 Jun 2007 15:30:07 +0000
parents	695884591af6
children	9a00ac84004c

rev	line source
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	1 #!/usr/bin/env python
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	2 # -- coding: utf-8 --
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	3 #
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	4 # Copyright (C) 2007 Edgewall Software
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	5 # All rights reserved.
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	6 #
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	7 # This software is licensed as described in the file COPYING, which
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	8 # you should have received as part of this distribution. The terms
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	9 # are also available at http://babel.edgewall.org/wiki/License.
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	10 #
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	11 # This software consists of voluntary contributions made by many
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	12 # individuals. For the exact contribution history, see the revision
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	13 # history and logs, available at http://babel.edgewall.org/log/.
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	14
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	15 import copy
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	16 from optparse import OptionParser
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	17 import os
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	18 import pickle
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	19 import sys
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	20 try:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	21 from xml.etree.ElementTree import parse
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	22 except ImportError:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	23 from elementtree.ElementTree import parse
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	24
11 11f64b232b04 Add basic support for number format patterns. jonas parents: 10 diff changeset	25 from babel import dates, numbers
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	26
17 aa33ad077d24 Minor date formatting improvements. cmlenz parents: 15 diff changeset	27 weekdays = {'mon': 0, 'tue': 1, 'wed': 2, 'thu': 3, 'fri': 4, 'sat': 5,
aa33ad077d24 Minor date formatting improvements. cmlenz parents: 15 diff changeset	28 'sun': 6}
10 0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	29
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	30 try:
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	31 any
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	32 except NameError:
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	33 def any(iterable):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	34 return filter(None, list(iterable))
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	35
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	36 def _text(elem):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	37 buf = [elem.text or '']
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	38 for child in elem:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	39 buf.append(_text(child))
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	40 buf.append(elem.tail or '')
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	41 return u''.join(filter(None, buf)).strip()
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	42
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	43 def main():
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	44 parser = OptionParser(usage='%prog path/to/cldr')
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	45 options, args = parser.parse_args()
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	46 if len(args) != 1:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	47 parser.error('incorrect number of arguments')
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	48
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	49 srcdir = args[0]
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	50 destdir = os.path.join(os.path.dirname(os.path.abspath(sys.argv[0])),
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	51 '..', 'babel', 'localedata')
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	52
10 0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	53 sup = parse(os.path.join(srcdir, 'supplemental', 'supplementalData.xml'))
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	54
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	55 # build a territory containment mapping for inheritance
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	56 regions = {}
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	57 for elem in sup.findall('//territoryContainment/group'):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	58 regions[elem.attrib['type']] = elem.attrib['contains'].split()
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	59
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	60 # Resolve territory containment
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	61 territory_containment = {}
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	62 region_items = regions.items()
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	63 region_items.sort()
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	64 for group, territory_list in region_items:
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	65 for territory in territory_list:
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	66 containers = territory_containment.setdefault(territory, set([]))
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	67 if group in territory_containment:
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	68 containers \|= territory_containment[group]
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	69 containers.add(group)
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	70
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	71 filenames = os.listdir(os.path.join(srcdir, 'main'))
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	72 filenames.remove('root.xml')
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	73 filenames.sort(lambda a,b: len(a)-len(b))
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	74 filenames.insert(0, 'root.xml')
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	75
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	76 dicts = {}
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	77
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	78 for filename in filenames:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	79 print>>sys.stderr, 'Processing input file %r' % filename
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	80 stem, ext = os.path.splitext(filename)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	81 if ext != '.xml':
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	82 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	83
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	84 tree = parse(os.path.join(srcdir, 'main', filename))
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	85 data = {}
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	86
10 0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	87 language = None
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	88 elem = tree.find('//identity/language')
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	89 if elem is not None:
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	90 language = elem.attrib['type']
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	91 print>>sys.stderr, ' Language: %r' % language
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	92
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	93 territory = None
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	94 elem = tree.find('//identity/territory')
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	95 if elem is not None:
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	96 territory = elem.attrib['type']
15 b47c34d42eda Extended and documented `LazyProxy`. cmlenz parents: 11 diff changeset	97 else:
b47c34d42eda Extended and documented `LazyProxy`. cmlenz parents: 11 diff changeset	98 territory = '001' # world
10 0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	99 print>>sys.stderr, ' Territory: %r' % territory
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	100 regions = territory_containment.get(territory, [])
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	101 print>>sys.stderr, ' Regions: %r' % regions
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	102
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	103 # <localeDisplayNames>
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	104
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	105 territories = data.setdefault('territories', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	106 for elem in tree.findall('//territories/territory'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	107 if 'draft' in elem.attrib and elem.attrib['type'] in territories:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	108 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	109 territories[elem.attrib['type']] = _text(elem)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	110
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	111 languages = data.setdefault('languages', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	112 for elem in tree.findall('//languages/language'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	113 if 'draft' in elem.attrib and elem.attrib['type'] in languages:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	114 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	115 languages[elem.attrib['type']] = _text(elem)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	116
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	117 variants = data.setdefault('variants', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	118 for elem in tree.findall('//variants/variant'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	119 if 'draft' in elem.attrib and elem.attrib['type'] in variants:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	120 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	121 variants[elem.attrib['type']] = _text(elem)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	122
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	123 scripts = data.setdefault('scripts', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	124 for elem in tree.findall('//scripts/script'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	125 if 'draft' in elem.attrib and elem.attrib['type'] in scripts:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	126 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	127 scripts[elem.attrib['type']] = _text(elem)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	128
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	129 # <dates>
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	130
10 0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	131 week_data = data.setdefault('week_data', {})
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	132 supelem = sup.find('//weekData')
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	133
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	134 for elem in supelem.findall('minDays'):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	135 territories = elem.attrib['territories'].split()
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	136 if territory in territories or any([r in territories for r in regions]):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	137 week_data['min_days'] = int(elem.attrib['count'])
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	138
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	139 for elem in supelem.findall('firstDay'):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	140 territories = elem.attrib['territories'].split()
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	141 if territory in territories or any([r in territories for r in regions]):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	142 week_data['first_day'] = weekdays[elem.attrib['day']]
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	143
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	144 for elem in supelem.findall('weekendStart'):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	145 territories = elem.attrib['territories'].split()
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	146 if territory in territories or any([r in territories for r in regions]):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	147 week_data['weekend_start'] = weekdays[elem.attrib['day']]
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	148
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	149 for elem in supelem.findall('weekendEnd'):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	150 territories = elem.attrib['territories'].split()
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	151 if territory in territories or any([r in territories for r in regions]):
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	152 week_data['weekend_end'] = weekdays[elem.attrib['day']]
0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	153
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	154 time_zones = data.setdefault('time_zones', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	155 for elem in tree.findall('//timeZoneNames/zone'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	156 time_zones[elem.tag] = unicode(elem.findtext('displayName'))
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	157
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	158 for calendar in tree.findall('//calendars/calendar'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	159 if calendar.attrib['type'] != 'gregorian':
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	160 # TODO: support other calendar types
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	161 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	162
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	163 months = data.setdefault('months', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	164 for ctxt in calendar.findall('months/monthContext'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	165 ctxts = months.setdefault(ctxt.attrib['type'], {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	166 for width in ctxt.findall('monthWidth'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	167 widths = ctxts.setdefault(width.attrib['type'], {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	168 for elem in width.findall('month'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	169 if 'draft' in elem.attrib and int(elem.attrib['type']) in widths:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	170 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	171 widths[int(elem.attrib.get('type'))] = unicode(elem.text)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	172
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	173 days = data.setdefault('days', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	174 for ctxt in calendar.findall('days/dayContext'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	175 ctxts = days.setdefault(ctxt.attrib['type'], {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	176 for width in ctxt.findall('dayWidth'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	177 widths = ctxts.setdefault(width.attrib['type'], {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	178 for elem in width.findall('day'):
10 0ca5dd65594f Pull in some supplemental data from the CLDR, for things like the first day of the week. cmlenz parents: 3 diff changeset	179 dtype = weekdays[elem.attrib['type']]
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	180 if 'draft' in elem.attrib and dtype in widths:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	181 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	182 widths[dtype] = unicode(elem.text)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	183
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	184 quarters = data.setdefault('quarters', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	185 for ctxt in calendar.findall('quarters/quarterContext'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	186 ctxts = quarters.setdefault(ctxt.attrib['type'], {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	187 for width in ctxt.findall('quarterWidth'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	188 widths = ctxts.setdefault(width.attrib['type'], {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	189 for elem in width.findall('quarter'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	190 if 'draft' in elem.attrib and int(elem.attrib['type']) in widths:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	191 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	192 widths[int(elem.attrib.get('type'))] = unicode(elem.text)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	193
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	194 eras = data.setdefault('eras', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	195 for width in calendar.findall('eras/*'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	196 ewidth = {'eraNames': 'wide', 'eraAbbr': 'abbreviated'}[width.tag]
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	197 widths = eras.setdefault(ewidth, {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	198 for elem in width.findall('era'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	199 if 'draft' in elem.attrib and int(elem.attrib['type']) in widths:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	200 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	201 widths[int(elem.attrib.get('type'))] = unicode(elem.text)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	202
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	203 # AM/PM
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	204 periods = data.setdefault('periods', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	205 for elem in calendar.findall('am'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	206 if 'draft' in elem.attrib and elem.tag in periods:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	207 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	208 periods[elem.tag] = unicode(elem.text)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	209 for elem in calendar.findall('pm'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	210 if 'draft' in elem.attrib and elem.tag in periods:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	211 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	212 periods[elem.tag] = unicode(elem.text)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	213
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	214 date_formats = data.setdefault('date_formats', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	215 for elem in calendar.findall('dateFormats/dateFormatLength'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	216 if 'draft' in elem.attrib and elem.attrib.get('type') in date_formats:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	217 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	218 try:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	219 date_formats[elem.attrib.get('type')] = \
11 11f64b232b04 Add basic support for number format patterns. jonas parents: 10 diff changeset	220 dates.parse_pattern(unicode(elem.findtext('dateFormat/pattern')))
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	221 except ValueError, e:
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	222 print>>sys.stderr, 'ERROR: %s' % e
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	223
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	224 time_formats = data.setdefault('time_formats', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	225 for elem in calendar.findall('timeFormats/timeFormatLength'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	226 if 'draft' in elem.attrib and elem.attrib.get('type') in time_formats:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	227 continue
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	228 try:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	229 time_formats[elem.attrib.get('type')] = \
11 11f64b232b04 Add basic support for number format patterns. jonas parents: 10 diff changeset	230 dates.parse_pattern(unicode(elem.findtext('timeFormat/pattern')))
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	231 except ValueError, e:
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	232 print>>sys.stderr, 'ERROR: %s' % e
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	233
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	234 # <numbers>
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	235
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	236 number_symbols = data.setdefault('number_symbols', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	237 for elem in tree.findall('//numbers/symbols/*'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	238 number_symbols[elem.tag] = unicode(elem.text)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	239
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	240 decimal_formats = data.setdefault('decimal_formats', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	241 for elem in tree.findall('//decimalFormats/decimalFormatLength'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	242 if 'draft' in elem.attrib and elem.attrib.get('type') in decimal_formats:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	243 continue
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	244 pattern = unicode(elem.findtext('decimalFormat/pattern'))
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	245 decimal_formats[elem.attrib.get('type')] = numbers.parse_pattern(pattern)
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	246
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	247 scientific_formats = data.setdefault('scientific_formats', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	248 for elem in tree.findall('//scientificFormats/scientificFormatLength'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	249 if 'draft' in elem.attrib and elem.attrib.get('type') in scientific_formats:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	250 continue
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	251 # FIXME: should use numbers.parse_pattern
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	252 scientific_formats[elem.attrib.get('type')] = unicode(elem.findtext('scientificFormat/pattern'))
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	253
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	254 currency_formats = data.setdefault('currency_formats', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	255 for elem in tree.findall('//currencyFormats/currencyFormatLength'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	256 if 'draft' in elem.attrib and elem.attrib.get('type') in currency_formats:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	257 continue
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	258 # FIXME: should use numbers.parse_pattern
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	259 currency_formats[elem.attrib.get('type')] = unicode(elem.findtext('currencyFormat/pattern'))
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	260
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	261 percent_formats = data.setdefault('percent_formats', {})
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	262 for elem in tree.findall('//percentFormats/percentFormatLength'):
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	263 if 'draft' in elem.attrib and elem.attrib.get('type') in percent_formats:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	264 continue
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	265 pattern = unicode(elem.findtext('percentFormat/pattern'))
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	266 percent_formats[elem.attrib.get('type')] = numbers.parse_pattern(pattern)
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	267
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	268 currency_names = data.setdefault('currency_names', {})
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	269 currency_symbols = data.setdefault('currency_symbols', {})
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	270 for elem in tree.findall('//currencies/currency'):
28 695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	271 name = elem.findtext('displayName')
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	272 if name:
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	273 currency_names[elem.attrib['type']] = unicode(name)
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	274 symbol = elem.findtext('symbol')
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	275 if symbol:
695884591af6 * Reduce size of locale data pickles by only storing the data provided by each locale itself, and merging inherited data at runtime. cmlenz parents: 24 diff changeset	276 currency_symbols[elem.attrib['type']] = unicode(symbol)
3 e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	277
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	278 dicts[stem] = data
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	279 outfile = open(os.path.join(destdir, stem + '.dat'), 'wb')
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	280 try:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	281 pickle.dump(data, outfile, 2)
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	282 finally:
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	283 outfile.close()
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	284
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	285 if __name__ == '__main__':
e9eaddab598e Import of initial code base. cmlenz parents: diff changeset	286 main()

Mercurial > babel > old > mirror

annotate scripts/import_cldr.py @ 29:cbda87af9aa0