Skip to content
Snippets Groups Projects
Name Last commit Last update
code
data
.gitignore
README.md
requirements.txt

disCut

Discourse segmenter for DISRPT 2021

Useful Links:

Requirements:

  • python 3.7
  • requirements.txt: pip install -r requirements.txt
  • pytorch: pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio===0.9.0 -f https://download.pytorch.org/whl/torch_stable.html

Usage:

  • train: bash expes.sh eng.rst.rstdt conllu bert train
  • test: bash expes.sh eng.rst.rstdt conllu bert test
  • fine-tune with other model: bash expes.sh eng.rst.rstdt conllu bert train eng
  • test on other model: bash expes.sh eng.rst.rstdt conllu bert test eng
  • merge two datasets: bash merger.sh eng.rst.rstdt eng.rst.gum eng
  • split with stanza: python parse_corpus.py eng.rst.rstdt --parser stanza