VANA/VANA-python/subtitles_processing/README.md

# subtitles_processing
Paket zur Aufbereitung der Untertitel.

## subtitles-processing.py
Normalisiert die Untertitel einer Episode. Die timecodes werden umgespeichert, damit immer ganze Sätze pro Zeile vorhanden sind.

```bash
python src/normalize_subtitles/subtitles-processing.py -a <"normalize"> -ep <int>
```

## count_words.py
Zählt die Wortanzahl pro Satz.
```bash
python src/normalize_subtitles/count_words.py -ep <int>
```

## sentence_sentiment.py
Rechnet die Sentimente pro Satz.
```bash
python src/normalize_subtitles/sentence_sentiment.py -ep <int>
```

## topics.py
Generiert Topics mittels LDA.
```bash
python src/normalize_subtitles/topics.py -ep <int>
```