Why and how my work evolved
Is the ℋ0 accepted?
There are no differences in the phonological acquisition of:
More than 14 hours of recorded conversations over 4 sessions
(9 three-session and 9 four-session female speakers + 3 four-session male speakers)
Text in French recorded by the four-session speakers
List of English words recorded by the four-session speakers
“Homemade” corpus of native spontaneous speech (NSS, ~4.5K tokens)
A multitier Praat TextGrid with phonemic and syllabic intervals aligned by SPPAS and P2FA for each recording
Datasheets with information for all corpora (1 row = 1 vowel)
(Add to this, the longitudinal aspect of the study)
Theory-driven: BEFMOA, BEFPOA, CLXFREQ…
Data-driven: PHONBEF, PHONAFT, PHONAFT…
Opportunistic and skill-driven: xSYLL, EPENTHETIC, Fxxx…
The counts of phonemes are unevenly distributed
(n = 17,189 for the 12 four-session speakers)
A look at the mean Euclidean distances between female native and learner values in the F1/F2 vocalic space
For mid-temporal values, the differences between raw and BDM-normalized values are minimal
When taking Peterson & Barney as reference point the distances are greater than with the NSS values
When taking Peterson & Barney as reference point /ɪ/ and /iː/ seem closer to native values
Reminder: Discrete Cosine Transform
(x-axis: percentage of the vowel’s duration; y-axis: Bark)
Discrete Cosine Transforms: K0 Inter-phoneme differences
Discrete Cosine Transforms: K1 Inter-phoneme differences
Discrete Cosine Transforms: K1 Inter-speaker differences
Regardless of how the data is processed, /ʊ/ and /uː/ feature more instability, and their formant values are further from native values than those of /ɪ/ and /iː/. The ℋ0 can be rejected.
Example of a KNN-based confusion matrix:
Expanding the NSS (using subtitles?)
Carrying out P2FA-based analyses
Adding syntactic tiers to the TextGrids
Analysing the influence of syllabic templates