Commandline¶
usage: cliparser.py [-h]
{tokenize,detokenize,sentence_split,normalize,morph,syllabify,wc,indic2roman,roman2indic,script_unify,script_convert}
...
Positional Arguments¶
subcommand | Possible choices: tokenize, detokenize, sentence_split, normalize, morph, syllabify, wc, indic2roman, roman2indic, script_unify, script_convert Invoke each operation with one of the subcommands |
Sub-commands¶
tokenize¶
tokenizer help
cliparser.py tokenize [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
detokenize¶
de-tokenizer help
cliparser.py detokenize [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
sentence_split¶
sentence split help
cliparser.py sentence_split [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
normalize¶
normalizer help
cliparser.py normalize [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
morph¶
morph help
cliparser.py morph [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
syllabify¶
syllabify help
cliparser.py syllabify [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
wc¶
wc help
cliparser.py wc [-h] [infile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
indic2roman¶
indic2roman help
cliparser.py indic2roman [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
roman2indic¶
roman2indic help
cliparser.py roman2indic [-h] [-l LANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
script_unify¶
script_unify help
cliparser.py script_unify [-h] [-l LANG] [-m {naive,basic,aggressive}]
[-c COMMON_LANG]
[infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-l, --lang | Language |
-m, --mode | Possible choices: naive, basic, aggressive Script unification mode Default: “basic” |
-c, --common_lang | |
Common language in which all languages are represented Default: “hi” |
script_convert¶
script convert help
cliparser.py script_convert [-h] [-s SRCLANG] [-t TGTLANG] [infile] [outfile]
Positional Arguments¶
infile | Input File path Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’> |
outfile | Output File path Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’> |
Named Arguments¶
-s, --srclang | Source Language |
-t, --tgtlang | Target Language |