Also posted to NFDI4Culture: https://tickets.nfdi4culture.de/work_packages/9750/activity
What DH & Repository tooling (software) is out there for working on a Corpus: Corpus Management, Packaging, Semtification, carry out Data Analysis and producing research outputs.
The reason for asking the question is that for an individual publication how do we make a publication usable, compatible with standards used in existing systems for corpus packaging and data analysis.
The kinds of tasks, functions, capabilities being looked at are:
- Collect corpus into one file system
- Package corpus with an inventory
- Corpus conversion to open standard format, interoperable standard, have validation of open standard format
- Corpus versioning and forking
- Semantification: Annotate with Names Entifty Recognition,
- Semantic concept annotation
- Enable NLP anaysis: Word frequency
- Enable syntactic and syntactic/semantic markup
- Enable TDM
- Research outputs: Allow for analysis of finding and results outputted as data and corpus copy if needed as Open Science being compatible
- Reporting on Corpus: Bibliometric, Presenting Knowledge and ideas, statitics to back findings, etc
Also posted to NFDI4Culture: https://tickets.nfdi4culture.de/work_packages/9750/activity
What DH & Repository tooling (software) is out there for working on a Corpus: Corpus Management, Packaging, Semtification, carry out Data Analysis and producing research outputs.
The reason for asking the question is that for an individual publication how do we make a publication usable, compatible with standards used in existing systems for corpus packaging and data analysis.
The kinds of tasks, functions, capabilities being looked at are: