VANA/VANA-python/subtitles_processing
Giò Diani f7c0df98b2 improvement topic modelling 2025-01-16 22:18:12 +01:00
..
src/subtitles_processing improvement topic modelling 2025-01-16 22:18:12 +01:00
.gitattributes improvement topic modelling 2025-01-16 22:18:12 +01:00
.gitignore improvement topic modelling 2025-01-16 22:18:12 +01:00
README.md improvement topic modelling 2025-01-16 22:18:12 +01:00
pixi.lock improvement topic modelling 2025-01-16 22:18:12 +01:00
pyproject.toml improvement topic modelling 2025-01-16 22:18:12 +01:00

README.md

subtitles_processing

Paket zur Aufbereitung der Untertitel.

subtitles-processing.py

Normalisiert die Untertitel einer Episode. Die timecodes werden umgespeichert, damit immer ganze Sätze pro Zeile vorhanden sind.

python src/normalize_subtitles/subtitles-processing.py -a <"normalize"> -ep <int>

count_words.py

Zählt die Wortanzahl pro Satz.

python src/normalize_subtitles/count_words.py -ep <int>

count_words.py

Rechnet die Sentimente pro Satz.

python src/normalize_subtitles/sentence_sentiment.py -ep <int>