Posted February 14, 2012 Atlanta, GA
Georgia Tech study shows certain words, phrases predict if messages sent up/down the org chart
ATLANTA – Feb. 14, 2012 – Members of the modern workforce might be surprised to learn that if they use the word “weekend” in a workplace email, chances are they’re sending the message up the org chart. Likewise the words “voicemail,” “driving,” “okay”—and even a choice four-letter word that rhymes with “hit.” However a new study by Georgia Tech’s Eric Gilbert shows that certain words and phrases indeed are reliable indicators of whether workplace emails are sent to someone higher or lower in the corporate hierarchy.
Gilbert, assistant professor in the School of Interactive Computing, focused his attention on the “Enron corpus,” a body of 500,000 emails among about 150 former Enron employees, making it the largest email dataset available for public study. Even after taking a “conservative, careful” approach—applying numerous filters and eliminating thousands of emails that would have muddied his conclusions—he still was able to identify lists of words that reliably predicted whether emails traveled up or down the ladder.
“Across a wide variety of messages and relationships, these phrases consistently stand out as signaling a power relationship between two people,” Gilbert said. “The probability of it occurring due to chance alone is less than 1 in 1,000.”
Primarily Gilbert’s work could be applied in designing “smarter” email software. Future email clients, he said, might be able to differentiate between emails sent from superiors or subordinates, and then use that information to better address someone’s email preferences. Post-5 p.m. messages from people under you, for example, might get held for delivery until the next day, while emails from the boss—or the boss’ assistant—could go right through.
“We have organizational charts, but they don’t tell the whole story,” said Gilbert, adding that the research could help map “informal power and reporting structures” in an organization. “A classic example is the CEO’s administrative assistant: That person may not occupy a high box on the org chart, but he or she still has a large amount of influence.”
Top 5 Upward Predictors
- the ability to
- I took
- are available
- thought you would
Top 5 Downward Predictors
- have you been
- you gave
- we are in
- need in
The project falls in the research field called “applied natural language processing,” which has been an important subfield of artificial intelligence for the past 25 years. Tools first developed in the area, however, can be applied to other data and yield enlightening results about human interaction through electronic media or simply how people relate to each other.
One complicating factor in Gilbert’s research was the dataset itself. Aside from its utility as grist for study, the Enron corpus can also be considered profoundly flawed as an example of “natural” language; after all, Enron and its officers were responsible for one of the largest, most systemic examples of corporate fraud in U.S. history, and Gilbert acknowledges that this behavior likely bled into those officers’ communications with each other. To minimize the chance that the statistical findings captured phenomena unique to Enron, Gilbert and his team manually went through the emails and removed phrases that appeared specific to the company. They also threw out all correspondence after Enron’s bad behavior had come to light.
“The trick,” he said, “was to find a point just before things really started to fall apart. We chose a date (May 1, 2001) several months before a formal investigation of Enron began and before its executives started selling off their own Enron stock. For all the emails that preceded that date, it’s reasonable to assume that Enron and its people behaved just like any other large private entity.”
Gilbert’s research will be presented today at the 2012 ACM Conference on Computer Supported Cooperative Work (CSCW), being held Feb. 11-15 in Seattle. Gilbert said CSCW, now in its 15th year, is widely regarded as the first conference devoted to social computing.
To view the full 100-word lists that reliably predict hierarchical direction, read Gilbert’s paper located at http://comp.social.gatech.edu/papers/cscw12.hierarchy.gilbert.pdf. And about that certain four-letter word that rhymes with hit?
“Yeah, I don’t know what the story on the cursing is,” Gilbert said. “Maybe that’s the next paper.”
About the Georgia Tech College of Computing
The Georgia Tech College of Computing is a national leader in the creation of real-world computing breakthroughs that drive social and scientific progress. With its graduate program ranked 10th nationally by U.S. News and World Report, the College’s unconventional approach to education is defining the new face of computing by expanding the horizons of traditional computer science students through interdisciplinary collaboration and a focus on human-centered solutions. For more information about the Georgia Tech College of Computing, its academic divisions and research centers, please visit http://www.cc.gatech.edu.
Assistant Director of Communications
College of Computing at Georgia Tech