Skip to content
GitLab
Projects
Groups
Snippets
Help
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Open sidebar
LINAGORA
L
LGS
Labs
kaldi-modelgen
Commits
5b6eff07
Commit
5b6eff07
authored
Jan 25, 2017
by
Abdelwahab HEBA
Browse files
fix synchro
parent
bbeb4078
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
4 deletions
+5
-4
run.sh
run.sh
+5
-4
No files found.
run.sh
View file @
5b6eff07
...
...
@@ -53,7 +53,7 @@ local/prepare_dict.sh --stage 3 --nj 4 --cmd "$train_cmd" \
###### OOOOOOK
utils/prepare_lang.sh data/local/dict
\
"<
unk
>"
data/local/lang_tmp data/lang
"<
UNK
>"
data/local/lang_tmp data/lang
export
LC_ALL
=
fr_FR.UTF-8
...
...
@@ -82,15 +82,16 @@ for part in dev test train; do
#steps/make_fbank.sh --cmd "$train_cmd" --nj 12 data/$part exp/make_fbank/$part $fbankdir
#steps/compute_cmvn_stats.sh data/$part exp/make_fbank/$part $fbankdir
done
# utils/fix_data_dir.sh data/train
utils/fix_data_dir.sh data/train
utils/fix_data_dir.sh data/test
utils/fix_data_dir.sh data/dev
# # Make some small data subsets for early system-build stages. Note, there are 29k
# # utterances in the train_clean_100 directory which has 100 hours of data.
# # For the monophone stages we select the shortest utterances, which should make it
# # easier to align the data from a flat start.
utils/subset_data_dir.sh
--shortest
data/train 15000 data/train_15kshort
utils/subset_data_dir.sh
--shortest
data/train 70000 data/train_70k
short
utils/subset_data_dir.sh data/train 70000 data/train_70k
utils/subset_data_dir.sh data/train 120000 data/train_120k
#utils/subset_data_dir.sh data/train 120000 data/train_120k
# # train a monophone system
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment