May.la
  • Blog
  • Machine Learning
  • Python
  • IT
  • Linux
  • Open Source Contributions

Tag: german-data

Anomalies in the MLSUM Dataset

Tags:
  • german-data
  • mlsum
  • mt5
  • summarization
  • t5
Categories:
  • NLP

Clean German Wikipedia Text Corpus released

Tags:
  • german-data
  • somajo
  • spacy
  • wikipedia
  • text-corpus
Categories:
  • Data
  • NLP

German colossal, cleaned Common Crawl corpus (GC4) released

Tags:
  • german-data
  • text-corpus
  • common-crawl
Categories:
  • Data
  • NLP
All Tags