Commandline

usage: cliparser.py [-h]
                    {tokenize,detokenize,sentence_split,normalize,morph,syllabify,wc,indic2roman,roman2indic,script_unify,script_convert}
                    ...

Positional Arguments

subcommand

Possible choices: tokenize, detokenize, sentence_split, normalize, morph, syllabify, wc, indic2roman, roman2indic, script_unify, script_convert

Invoke each operation with one of the subcommands

Sub-commands

tokenize

tokenizer help

cliparser.py tokenize [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

detokenize

de-tokenizer help

cliparser.py detokenize [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

sentence_split

sentence split help

cliparser.py sentence_split [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

normalize

normalizer help

cliparser.py normalize [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

morph

morph help

cliparser.py morph [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

syllabify

syllabify help

cliparser.py syllabify [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

wc

wc help

cliparser.py wc [-h] [infile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

indic2roman

indic2roman help

cliparser.py indic2roman [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

roman2indic

roman2indic help

cliparser.py roman2indic [-h] [-l LANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language

script_unify

script_unify help

cliparser.py script_unify [-h] [-l LANG] [-m {naive,basic,aggressive}]
                          [-c COMMON_LANG]
                          [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-l, --lang Language
-m, --mode

Possible choices: naive, basic, aggressive

Script unification mode

Default: “basic”

-c, --common_lang
 

Common language in which all languages are represented

Default: “hi”

script_convert

script convert help

cliparser.py script_convert [-h] [-s SRCLANG] [-t TGTLANG] [infile] [outfile]

Positional Arguments

infile

Input File path

Default: <_io.TextIOWrapper name=’<stdin>’ mode=’r’ encoding=’UTF-8’>

outfile

Output File path

Default: <_io.TextIOWrapper name=’<stdout>’ mode=’w’ encoding=’UTF-8’>

Named Arguments

-s, --srclang Source Language
-t, --tgtlang Target Language