text/symbols.py: updating symbols

clean_slate
rafaelvalle 2018-11-25 22:34:26 -08:00
parent cdfde985e5
commit 1ea6ed5861
1 changed files with 4 additions and 3 deletions

View File

@ -7,11 +7,12 @@ The default is a set of ASCII characters that works well for English or text tha
from text import cmudict from text import cmudict
_pad = '_' _pad = '_'
_eos = '~' _punctuation = '!\'(),.:;? '
_characters = 'ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz!\'(),-.:;? ' _special = '-'
_letters = 'ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz'
# Prepend "@" to ARPAbet symbols to ensure uniqueness (some are the same as uppercase letters): # Prepend "@" to ARPAbet symbols to ensure uniqueness (some are the same as uppercase letters):
_arpabet = ['@' + s for s in cmudict.valid_symbols] _arpabet = ['@' + s for s in cmudict.valid_symbols]
# Export all symbols: # Export all symbols:
symbols = [_pad, _eos] + list(_characters) + _arpabet symbols = [_pad] + list(_special) + list(_punctuation) + list(_letters) + _arpabet