Skip to content
Snippets Groups Projects

Requirements

ersatz (pip install ersatz)

Usage

python tok2conllu.py <file>

File must be a .tok file.

Output

<file>_conllu: same file but sentences are separated by a line-jump and a commented line with sentence id.

Ersatz library

Command-line usage:

ersatz --input <input.txt> --output <output.txt>

Takes as input any text file and outputs the same text file with sentences separated by a line-break.