Using data drawn from queries entered into Google, Microsoft and Yahoo search engines, scientists at Microsoft, Stanford and Columbia University have for the first time been able to detect evidence of unreported prescription drug side effects before they were found by the Food and Drug Administration's warning system.
Using automated software tools to examine queries by 6 million Internet users taken from Web search logs in 2010, the researchers looked for searches relating to an antidepressant, paroxetine, and a cholesterol-lowering drug, pravastatin. They were found evidence that the combination of the two drugs caused high blood sugar.
The study, which was reported in the Journal of the American Medical Informatics Association on Wednesday, is based on data-mining techniques similar to those employed by services like Google Flu Trends, which has been used to give early warning of the prevalence of the sickness to the public.
The FDA asks physicians to report side effects through a system known as the Adverse Event Reporting System, but its scope is limited by the fact that data is generated only when a physician notices something and reports it.
The new approach is a refinement of work done by the laboratory of Russ B. Altman, the chairman of the Stanford bioengineering department. The group had explored whether it was possible to automate the process of discovering "drug-drug" interactions by using software to hunt through the data found in FDA reports.
The group reported in May 2011 that it was able to detect the interaction between paroxetine and pravastatin in this way. Its research determined that the patient's risk of developing hyperglycemia was increased compared with taking either drug individually.
The new study was undertaken after Altman wondered whether there was a more immediate and more accurate way to gain access to data similar to what the FDA could get.
He turned to computer scientists at Microsoft, who created software for scanning anonymized data collected from a software toolbar installed in web browsers by users who permitted their search histories to be collected. The scientists were able to explore 82 million individual searches for drug, symptom and condition information.