July 02, 2026 07:54 pm (IST)
Follow us:
facebook-white sharing button
twitter-white sharing button
instagram-white sharing button
youtube-white sharing button
India-US trade deal almost done! Piyush Goyal hints at breakthrough | Ram Mandir donation scam: Champat Rai points finger at his own driver | PM Modi welcomes Japanese PM Sanae Takaichi as India-Japan ties enter a new era | 'Not an isolated incident': India slams Pakistan after 125-year-old historic Gurdwara is demolished | Ram Mandir donation theft: Six accused were employed by Varanasi-based security firm, probe reveals | Ayodhya Ram Temple donation theft: Probe says majority of money was allegedly stolen during Kumbh Mela | Commercial LPG price slashed by Rs 183.50 from July 1; check new rates in Delhi, Mumbai, Kolkata and Chennai | Trump suffers major blow as US Supreme Court upholds birthright citizenship | Delhi-Mumbai Expressway horror: Passenger bus goes up in flames after fatal collision, 8 dead | 'Dharmendra Pradhan will be responsible if anything happens': CJP warns as Sonam Wangchuk's health worsens on day 3 of hunger strike

Analysis of billions of Twitter words reveals how American English develops

| @indiablooms | Sep 26, 2018, at 05:17 pm

London, Sept 26 (IBNS): New words in American English tend to develop in five regional linguistic ‘hotspots’ before spreading across the United States and beyond, a new study reveals.

Linguists and geographers analysed 8.9 billion words contained within 980 million Tweets posted across the United States between 2013 and 2014 to identify the regions from which new words tend to originate.

Led by Professor Jack Grieve, from the Centre for Corpus Research at the University of Birmingham, researchers used advanced computer technology to analyse the geocoded Tweets which revealed the precise longitude and latitude of the user at the time of posting.

They tracked the origin of 54 newly emerging words in American English. For example, they found that the word ‘baeless’, which mean ‘to be single’, originated from Deep South, while the word ‘mutuals’, which is short for ‘mutual friends’, originated from the West Coast.

Geo-coded data from Twitter allowed them to create maps for these 54 words, showing how the phrases had spread across the country over time.

Applying modern computational techniques to the study of language variation and change, the team identified that development of new words in Modern American English centred on five regions: The West Coast, the Northeast, the Mid Atlantic, the Deep South, and the Gulf Coast.

Professor Grieve commented: “This is the first time that such a large sample of emerging words or any type of linguistic innovation has been mapped in one language. Twitter is only one variety of language, but given that almost all these words are used in everyday speech, we believe our results reflect the words’ general spread in American English.

“Our study provides a framework for future research by showing how the origin and spread of emerging words can be measured and mapped. Linguistics is shifting from a social science to a data science, where linguists are increasingly analysing massive amounts of natural language harvested online.

“This is allowing us to pursue new research questions that would have been impossible to investigate just a few years ago. We can analyse in very fine detail how language changes over short periods of time and understand the processes through which languages evolve - one of the most challenging questions in science.”

The researchers’ findings also challenge existing theories of the spread of new words. They show that new words do not simply spread out unconstrained from their source, nor do they spread from one large city to the next, as predicted by previously developed theories for the spread of new words, known as the ‘wave’ and ‘gravity’ models.

Instead, the study found the spread of new words is constrained by cultural patterns. New words tend to spread within cultural regions, before reaching the rest of the United States.  It also found that African American English was a major source of lexical innovation on US Twitter.

Professor Grieve is speaking about the team’s research at the New Ways of Analyzing Variation (NWAV) conference held at New York University from October 18 to 21. He will focus on how these words spread just in New York City over the time period in question, as well as  delivering a workshop on 'computational sociolinguistics'.

Support Our Journalism

We cannot do without you.. your contribution supports unbiased journalism

IBNS is not driven by any ism- not wokeism, not racism, not skewed secularism, not hyper right-wing or left liberal ideals, nor by any hardline religious beliefs or hyper nationalism. We want to serve you good old objective news, as they are. We do not judge or preach. We let people decide for themselves. We only try to present factual and well-sourced news.

Support objective journalism for a small contribution.