Thai News Dataset from Thai government website.
-
Updated
Jun 28, 2024 - Jupyter Notebook
Thai News Dataset from Thai government website.
open source corpora created, annotated or maintained by the ACoLi group at University of Augsburg, Germany.
A corpus and models for the automated legal assessment of clauses in German consumer contracts.
My Implementations' Archive
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
A very simple news crawler with a funny name
Python & command-line tool to gather text on the Web: Crawling & scraping, content extraction, metadata. TXT, Markdown, CSV & XML output.
FluCoMa's Learn Platform
粵文語料篩選器 Cantonese text filter
MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare
BlackLab Frontend, a feature-rich corpus search interface for BlackLab.
ELLE - Estonian language learning and analysis environment for learners, educators and linguists
Extracting character conversations in Genshin Project
Linguistic search for large annotated text corpora, based on Apache Lucene
L2SCA & LCA fork: cross-platform, GUI, without Java dependency
A Natural Language Processing model to perform Sentiment Analysis of US Airline Customers
Add a description, image, and links to the corpus topic page so that developers can more easily learn about it.
To associate your repository with the corpus topic, visit your repo's landing page and select "manage topics."