Paraphrase Mining
Data
- German Wikipedia
- https://dumps.wikimedia.org/dewiki/
- currently:
dewiki-20220101-pages-articles.xml.bz2
- Quora for duplicate questions
Paper and Links
- https://arxiv.org/abs/2010.08240
- https://towardsdatascience.com/advance-nlp-model-via-transferring-knowledge-from-cross-encoders-to-bi-encoders-3e0fc564f554
Last modified July 16, 2022: fix headlines in ML doc (a533e7c)