Grammalecte  Check-in [ecdbcd1539]

Overview
Comment:[build] màj: build_data
Downloads: Tarball | ZIP archive | SQL archive
Timelines: family | ancestors | descendants | both | trunk | build
Files: files | file ages | folders
SHA3-256: ecdbcd1539a2283891c81aa22d023e1488d0d0232b0ee77a5e4320aebcaf644b
User & Date: olr on 2017-06-23 23:09:46
Other Links: manifest | tags
Context
2017-06-23
23:23
[fr] phonet_simil: simplification des lignes contenant des verbes du 1ᵉʳ groupe à l’infinitif check-in: a1812bdacc user: olr tags: fr, trunk
23:09
[build] màj: build_data check-in: ecdbcd1539 user: olr tags: build, trunk
22:46
[build] more options for build_data check-in: 891cbb4e9d user: olr tags: build, trunk
Changes

Modified gc_lang/fr/build_data.py from [79b36913b6] to [89b72162ae].

   266    266           return
   267    267   
   268    268       with open(sp+"/data/phonet_simil.txt", 'r', encoding='utf-8') as hSrc:
   269    269           # set of homophonic words
   270    270           lSet = []
   271    271           for sLine in hSrc.readlines():
   272    272               if not sLine.startswith("#") and sLine.strip():
   273         -                aWord = set(sLine.strip().split())
          273  +                lWord = sLine.strip().split()
   274    274                   aMore = set()
   275         -                for sWord in aWord:
          275  +                for sWord in lWord:
   276    276                       if sWord.endswith("er") and conj.isVerb(sWord):
   277    277                           aMore = aMore.union(conj.getConjSimilInfiV1(sWord))
   278         -                aWord = aWord.union(aMore)
   279         -                lSet.append(aWord)
          278  +                lWord.extend(list(aMore))
          279  +                lSet.append(lWord)
          280  +                #print(lWord)
   280    281           # dictionary of words
   281    282           dWord = {}
   282    283           for i, aSet in enumerate(lSet):
   283    284               for sWord in aSet:
   284    285                   if oDict.lookup(sWord):
   285    286                       dWord[sWord] = i  # warning, what if word in several sets?
   286    287                   else:

Modified gc_lang/fr/modules/phonet_data.py from [98fafd6c52] to [63ece6deca].

cannot compute difference between binary files