disCut
Discourse segmenter for DISRPT 2021
Data for DISRPT 2021: https://github.com/disrpt/sharedtask2021 Website DISRPT 2021: https://sites.google.com/georgetown.edu/disrpt2021 Code for DISRTP 2019: https://gitlab.inria.fr/andiamo/tony
Meeting 21.05.2021
TODO:
- install allennlp 0.9 + tony19
- train a model (on english for instance)
- test it with tony script
- begin reading the tutorial on allennlp
- continue with general reading NLP
Next steps:
- change to allennlp 1.xx
- switch to xlm multi lingual
- grouping the corpora during training
- the sentence problem ? how do we address it