Research: Intelligence and Corporate Intelligence / Media Analysis / Industry Monitoring / Brand Mining / Public Perception / Social Media / Event Mining
I am the chief architect of the largest open source intelligence project in academia, leveraging commercial, governmental, and declassified intelligence products to produce global databases of human activity across multiple disciplines over time. One project involves cataloging all "societal stability" events (riots, assassinations, protests, etc) across the world from 1946 to present using tens of millions of local news reports captured and translated from the local presses of each nation. I was also the chief architect of the NCSA VIAS project, one of the early "web-scale" industry monitoring systems. I am founder of the Carbon Capture Report, the premier global climate change news analytics service. I have worked extensively on a wide range of corporate intelligence, event mining, and media monitoring initiatives, especially issues such as trend mining and public perception.
- An Open Source Study of International Media Coverage of the WorldCom Scandal. Journal of International Communication, Vol 14, Issue 2. (2008).
- The use of data mining methods to evaluate public interest in carbon sequestration. Fifth Annual Conference on Carbon Capture and Sequestration, Alexandria Mark Center, Alexandria, Virginia, May 8-11, 2006. (Poster presentation on CD-ROM proceedings as Poster 075).
- Automated Crisis Mapping Talk. An overview of my work on global news mining of societal stability events.
MEDIA ANALYSIS / INDUSTRY MONITORING / BRAND MINING / PUBLIC PERCEPTION
I have worked extensively on a wide array of media analysis, brand mining, industry monitoring, and public perception problems, including numerous "grand challenge" problems.
- Public Perception. Public perception monitoring is the use of sophisticated perceptual models of human stimuli response to model the overall reaction and perception of a given topic in the mainstream and social media spheres. I have worked extensively on public perception monitoring for a wide range of organizations and topical areas, including areas such as [sentiment analysis].
- Brand Mining. I've worked on numerous "brand mining" initiatives to develop realtime and broad-scoped metrics of how a particular corporate identity or brand is being perceived and portrayed in the mainstream and social media spheres.
- Industry Monitoring. Brand mining is ideal for flagship brands, but most large corporations have hundreds or even thousands of distinct brands, making it difficult to see the company's overall performance in the marketplace. More importantly, trends in a single brand are not meaningful without the surrounding context of the overall marketplace's performance. For example, a monitoring system tracking an automobile brand 2000-2009 would note that coverage in 2009 was more subdued and had much less volume than 2000. This metric by itself would be misleading, however, since the automobile industry as a whole has fallen in coverage, and an individual brand's performance must be normalized by this larger shift. Industry monitoring captures the state of an entire industry and allows such comparisons.
- Competitive and Risk Analysis. Industry trends and public perception patterns are all critical components in the risk analysis of new product launches and evolving brands. Robust insights into competitive risk are only possible with a detailed understanding of media flow and interaction patterns and the proper integration of monitoring technologies.
One of my current signature initiatives is the Carbon Capture Report, a global news and social media monitoring and public perception analytics platform for studying global patterns in climate change coverage. The system continually monitors the mainstream and social media spheres, including the blogosphere, Twitter, YouTube, and social bookmarking sites, in realtime, performing "crowdsourcing", advanced geographic intelligence analysis, data mining and analytics, and compiling autonomous bibliographies of all people and companies mentioned in the news and their relationships. The core underlying technology is easily adapted to other industries or topical areas and a wide range of more sophisticated analytical services are available.
VIASI was the chief architect and technical director of the NCSA VIAS project. VIAS is an award-winning domain-specific information retrieval, archival, and processing system. It has been at the forefront of a number of Fortune 50 corporate intelligence initiatives and deployed extensively in data mining and intelligence applications for corporate, academic, and governmental clients over the better part of a decade. A VIAS Dynamic Knowledge Repository (DKR) is seeded with a set of topic words, URLs, and other information that define the operational parameters of its topical area. VIAS then monitors all known mailing lists, USENET groups, available wirefeeds, and other streaming sources, as well as autonomously crawls the web using a federation of directed web crawlers to collect information on this topic. All information is screened for relevance against the topical parameter space and then undergoes a series of metadata extraction and production processes, where filters identify people, company, and organization names, bibliographic references, and acronym resolutions, together with higher-order processing such as summary abstract generation. An advanced analytics and visualization suite offers a range of features based on this generated metadata, such as temporal entity graphs and relationship mining.
- VIAS: A Pathway to Knowledge.
- VIAS: A Pathway to Knowledge: compiling continually updated categorized information on dynamic topics.
SOCIAL MEDIA
I've worked extensively on the intersection of the social and mainstream media spheres, including diffusion across geographic and typographic borders. I've developed a large array of robust fully automated analytical tools that have been applied in a number of large intelligence and media monitoring initiatives, including advanced crowdsourcing platforms.
- New media vs old media: A portrait of the Drudge Report 2002-2008. First Monday. Vol. 14, Issue 7. (July 6, 2009). Cover article of issue. Covered in several hundred blogs and independent media outlets, along with:
- Columbia Journalism Review (CJR Editors, November 10, 2009). Covers report by Greg Beato using the study's data.
- Columbia Journalism Review (Drudge Has Lost His Touch, by Ethan Porter, Sep/Oct 2009 Feature)
- Gawker: How Reuters Underwrote Andrew Breitbart's Budding Right-Wing Web Empire
- Politico
- Gawker: Matt Drudge By the Numbers
- Gawker: The Drudge Report Power List
- The New York Observer
- KFWB All News Radio in Los Angeles
- IMDB (The Internet Movie Database)
- The Wrap
EVENT MINING
I am the chief architect of the Social Political Economic Event Database (SPEED) at the University of Illinois Cline Center for Democracy, an effort to compile a global event database covering 172 countries from 1946-present. The resulting database will consist of all major human events in those categories occurring anywhere on earth in the past 60 years, offering an unprecedented view into the underpinnings of societal evolution. More than 1,200 variables are captured for each event, including date and location to city or landmark resolution. A combination of machine and human processes are used to automate much of the heavy lifting of event coding, while still allowing for the sophistication of interpretation afforded by human analysts. The combined event dataset takes the form a vast network of human activity, with each event being a node and all 1,200 variables acting as links connecting those events. Predictive analytics and "scenario modeling" can ultimately use this event network to forecast sets of potential outcomes for emerging conflict across the globe.- Automated Crisis Mapping Talk. An overview of my work on global news mining of societal stability events.
- Global Entrepreneurship Event Database Project: AEL Grant Final Report. I conducted a small grant-funded pilot study in 2009 for the Academy of Entrepreneurial Leadership at the University of Illinois (AEL) on the compilation of a global database of entrepreneurship events from US newspaper content. (August 13, 2009).
WHITEPAPERS
- An Analytical Partial Content Analysis Bibliography: 1938-2008. Released March 18, 2008, this project was an attempt to catalog historical books written on content analysis and offer a rough catagorization of them by major focus area.
- Agenda Setting: An Overview. An overview of agenda setting that paints a brief evolutionary history of the field using the annotated reading list below. (June 18, 2009).
- Agenda Setting: An Annotated Reading List. An annotated reading list of key developments in the history of agenda setting theory. (June 18, 2009).
- Book Review: The Myth of Media Globalization (Kai Hafez, 2007). A brief review of Kai Hafez's 2007 book on media globalization. (November 4, 2009).
- Language Use in Advertising: An Analysis of Linguistic Features Across Readership Domains. (May 9th, 2001).