diff --git a/README.md b/README.md index 19a5f7b46a848ed3d4f6ebc78779db6f236d29bc..a2be77c4497eff43213ec7fa9ea8e6b57a5e1624 100644 --- a/README.md +++ b/README.md @@ -1,3 +1,22 @@ # disCut -Discourse segmenter for DISRPT 2021 \ No newline at end of file +Discourse segmenter for DISRPT 2021 + +Data for DISRPT 2021: https://github.com/disrpt/sharedtask2021 +Website DISRPT 2021: https://sites.google.com/georgetown.edu/disrpt2021 +Code for DISRTP 2019: https://gitlab.inria.fr/andiamo/tony + +## Meeting 21.05.2021 + +TODO: +- install allennlp 0.9 + tony19 +- train a model (on english for instance) +- test it with tony script +- begin reading the tutorial on allennlp +- continue with general reading NLP + +Next steps: +- change to allennlp 1.xx +- switch to xlm multi lingual +- grouping the corpora during training +- the sentence problem ? how do we address it