Linguistics Professor Victor Kuperman – mining the web for big data
The world’s largest collection of free data is located right at our fingertips. But what’s to be done when the desired sample size is so vast it could take weeks, months or even years to collect and process via conventional Internet browsing? Enter McMaster assistant professor Victor Kuperman. Throughout the past year, Kuperman and two McMaster PhD candidates have been reading and studying roughly 1.8 million blogs and webpages from 340,000 websites around the globe using a fleet of high-powered computers in Togo Salmon Hall. The sites are personal, commercial and governmental in nature.