Nkululeko: how to explicitly model linguistics

22. July 2025 felix Leave a comment

With nkululeko since version 0.96 you there are linguistic feature extractors, i.e. using the text of the spoḱen words as input.

Of course you can combine them with acoustic features and use any fitting model architecture with it.

[EXP]
# optional: language for linguistics
language = de

[DATA]
data = ../mydata
# the linguistic feature extractors require a column named "text"
# example, perhaps not needed!
data.col_names = {"transcription":"text"}

[FEAT]
# combine linguistic bert features with acoustic open smile features
type = ['bert', 'os']

[MODEL]
type = xgb

Allgemein

Nkululelo: how to translate your textual transcriptions

14. July 2025 felix Leave a comment

With nkululeko since version 0.95.9 you can use google translate to translate your data automatically.

Simply set the language (default is en) in the PREDICT section and a prediction target translation like this:

[EXP]
# optional
language = de 
[PREDICT]
targets = ['translation']
# optional
target_language = en

and then run the module:

python -m nkululeko.predict --config my_conf.ini.

speechsurfer

Monthly Archives: July 2025

Nkululeko: how to explicitly model linguistics

Nkululelo: how to translate your textual transcriptions

blog around speech technology