... McConnell-Ginet, 2003).Analyzing such differences is not only interestingfrom the sociolinguistic and psycholinguistic pointof view of language understanding, but also froman engineering perspective, ... speakers usedfor training and 100 speakers used for testing, re-sulting in a total of 4062 conversation sides fortraining and 808 conversation sides for testing.4 Modeling Gender via Ngram ... training data and the ngram-based modelwas retrained on the remaining subset.Figure 2: Empirical differences in sociolinguistic featuresfor Gender on the Switchboard corpus6 Incorporating...