Johnson & Johnson | Logo

Research Publications Intelligence

Accelerating pharma market insights using public data and precision scraping

JJ | Tags
JJ | Hero Image Asset
JJ | Hero Image Asset
Johnson & Johnson | Logo

Research Publications Intelligence

Accelerating pharma market insights using public data and precision scraping

JJ | Tags

 

 

Johnson & Johnson The Challenge Image

The Challenge

A Top 500 pharma company needed a scalable and reliable way to collect actionable competitive intelligence from a vast pool of publicly available research publications. The goal was to identify how 13 major pharmaceutical competitors were investing in key areas like therapeutic innovation, disease research, and data science strategies.


This required finding the right data sources, extracting relevant content, identifying competitor mentions, and capturing it all in a structured, machine-readable format. The biggest hurdles were data quality, trustworthiness of sources, balanced coverage across all competitors, and the ability to replicate the process through clean, documented code.

 

 

The Solution

Through Topcoder’s Innovation Challenge model, participants were tasked with identifying trustworthy sources, scraping research content, and tagging it with competitor-specific insights.


The solution included a structured dataset of over 120 submissions formatted in JSONL, featuring metadata, publication dates, and relevance snippets for 13 major pharmaceutical competitors. Submissions were built using Docker and scored based on the depth of insights, coverage consistency across all competitors, and the ability to surface emerging trends in therapeutic innovation and strategy.

Challenge we ran:


Innovation Series IC6 - Pharma Competitor Insight Quest - Data extraction from Research Publications (1/7)

 

9

Challenges

77

Participants

127

Submissions

 

JJ | The Impact Image asset

The Impact

The customer gained a reproducible and scalable framework for competitive intelligence, reducing the time and manual effort typically required for this kind of research. The final dataset helped uncover patterns in competitor activity, spotlighted emerging research trends, and enabled faster, data-backed decision-making.

By outsourcing the data extraction task to Topcoder’s global community, the customer accessed a broader pool of technical expertise, which resulted in a more balanced and comprehensive result set than a traditional in-house effort might have achieved.

 

 

 

Achieve high-quality outcomes with

Topcoder.

Achieve high-quality outcomes with Topcoder.

 

Talk to an expert