Seminar Project for Multimodal Corpus Linguistics seminar in SS18
- python 3.5+
- keras 2
- tf 1.8
- weighted class weights, attention wrapper,
- train on different data type (scrripted/ improvised/ both)
- train on different speech features (static/ dynamics - deltas and deltasdeltas)
- train on different modalities (speech/ text/ motion/ all/ configure your own combinations in ./configs)
- train on different ANN architectures (convs/ rnns/ configure your own models in ./models)
- speech speaker independent configuration
python HearYou2.0.py -c configs/<model_to_run>.json
run all configurations stored in ./configs if -c flag is not given
| data type(all feat) | Scripted | Improvised | Both | |||
|---|---|---|---|---|---|---|
| MFCC | all | MFCC | all | MFCC | all | |
| text | xx% | xx% | 56% | xx% | 61% | xx% |
| speech | xx% | xx% | 34% | xx% | 51% | xx% |
| mocap | xx% | xx% | xx% | xx% | 45% | xx% |
| text+speech | xx% | xx% | xx% | xx% | 67% | xx% |
| text+speech+mocap | xx% | xx% | xx% | xx% | 70% | xx% |
| feature type | MFCC | 34 |
|---|---|---|
| speech | 57% | 51% |
| speech+mocap | 73% | 74% |
| text+speech | 65% | 50% |
| text+speech+mocap | 76% | 69% |
| feature type | MFCC | 34 |
|---|---|---|
| speech | 53% | 51% |
| speech+mocap | 41% | 47% |
| text+speech | 54% | 51% |
| text+speech+mocap | 44% | 38% |
| feature type | MFCC | 34 |
|---|---|---|
| speech | 50% | 50% |
| speech+mocap | 61% | 52% |
| text+speech | 50% | 52% |
| text+speech+mocap | 60% | 50% |
- (text lstm without attention)
IEMOCAP data
Feature Extraction Library
Deltas & DeltasDeltas
Conceptor
Multimodality