BETA
This is a BETA experience. You may opt-out by clicking here

More From Forbes

Edit Story

AI Finally Gives Everyone A Voice... Whether We Like It Or Not!

Following

Back in Ancient Greece civic leaders could easily and directly consult the views of people in public spaces like the Agora. So too in Rome, the Forum provided a place to canvas diverse perspectives on issues from new ideas, laws and commercial propositions.

Surveys and focus groups replace the forum but with limitations

While the Agora and Forum worked well previously, with the growth of much larger cities and a move to written culture, finding out what a large number of people think has become more challenging. Since the 1960s two main strategies have evolve to take their place: you can either find a small but hopefully representative and diverse subset of people — a focus group and use their views to represent those of many others like them or, alternatively, you can ask lots of people via a poll with a simplified phone or online questionnaire which surveys views on specific questions.

Focus groups try to understand the nature of preferences, and their causes and as well as seeking unsolicited views and unstructured responses whereas Surveys and Polling is often used in quantitative research to try and find answers to specific questions from lots of people and thus predict things like who will likely win the next election, what the economy wide consumer sentiment is like and so on. While we know both of these have benefits, insiders also know each also has major flaws and neither is particularly democratic. If you’re influential and an outlier in your views, your views are not likely to be well represented in focus groups and if the issues are complex and types of responses are not immediately obvious, important details may be missed by the crude instrument of the survey.

What if you could combine both approaches and seek input from large numbers of people in open and unstructured ways that suit them and yet still be in a position to make sense of it once the results are in and way to ensure that each and everyone's viewpoints are considered when analyzing the results.

A new approach offers the best of both worlds

New techniques in data science provide a way for Governments, Businesses and NGOs to analyze very large numbers of written public submissions, transcripts from public hearings and statements made in public social media on wide-ranging topics and to be able to make better sense of them all in a structured and fair way.

Earlier this year Australia faced some of the worst floods it has faced in recorded history in Sydney as well as up and down the Eastern Coast coast of Australia. Thirteen people died and 4000 homes were destroyed in New South Wales alone. In response, the State Government established an independent inquiry to examine the causes, consequences and impacts of this devastating natural disaster and how better to plan for likely future natural disasters.

The New South Wales Flood Inquiry led by former Police Commissioner Michael Fuller and former Chief Scientist Mary O'Kane has now published its findings in full. As part of the process, the inquiry received over 1,400 detailed submissions from the public, businesses and emergency services organisations impacted by the flooding.

My colleagues and I at League of Scholars assisted the inquiry with independent analytics using a variety of data science techniques including machine-learning-based topic modelling to “read all the submissions” and reveal that within the 1400 submissions there were six key distinct themes. Each of the submissions could be categorized into one of these six themes or topics namely: Homes & Family (698); Water Engineering (414); Emergency Services (191); Planning in light of Climate Change (78); Recovery (54) and the Environment (15).

Further analysis of submissions by location revealed which themes were most relevant for different communities and in different towns. Furthermore, the process identified the most representative submission from each theme, so you can appraise and get a good understanding of the key issues by reading half a dozen of these submissions that are representative of many hundreds of others like them. As well, we tested the level of originality in each submission to see whether there was any plagiarism or campaigning where the same submission was sent by multiple people or cut and pasted from the internet. In this case, nearly all submissions were highly original and so there wasn’t any evidence for plagiarism found.

As well, we conducted detailed analytics of flood-related social media — over 50,000 public posts on Twitter that were made during the floods and subsequent cleanup efforts and that chronicled the unfolding of events and which news services people most turned to as revealed through link sharing patterns.

A comprehensive analysis of search traffic data, found more than one hundred search phrases whose frequency was directly correlated with a Rainfall index associated with the Flooding — some of these were things people searched for during the floods themselves and others were correlated but with a lag or lead of a number of days. So, for example, seven days after the flooding many people were searching for information about landslides and mudslides indicating the risks associated with aftershocks.

A summary of the full findings on submissions, social and search analytics has now been published as part of the impressive Flood Inquiry three-volume report in the appendix.

These approach taken here combines many of the benefits of large-scale quantitative research initiatives (accurate, representative, large sample sizes, enables predictive analysis) as well as those of qualitative research (enabling responses to be unsolicited, unbiased and unlimited in their scope and can facilitate long and subtle responses to complex issues) into a more unbiased and balanced way to understand many peoples diverse views on complex, contentious or subtle topics.

New data science techniques like the ones illustrated above offer Governments, NGOs and companies new ways to analyze large-scale collections of public submissions and thus invite more input from lots of people using a more open-ended and unlimited process that can still be distilled into actionable insights to improve policy, processes and products.

Follow me on Twitter or LinkedInCheck out some of my other work here