CastorVH - Data Gathering Challenge

Register
Submit a solution
The challenge is finished.

Challenge Overview

Challenge Details
  • Write code which will extract the college names in USA with the details provided in the scope.
  • Persist the extracted data in a database table

Project Background

Customer is a global healthcare company, they research, develop and manufacture consumer healthcare products. The purpose of this project to help find the universities and colleges that have the mandatory Seasonal Flu vaccination and also find universities/colleges that just recommend it.

Technology Stack
  • Python is preferred, however nodejs or java can be used for data extraction script.
  • The data can be persisted in an MS SQL Server 2017.

Individual Requirements

Website Scrapper
  • Gather list of colleges in USA. Starting point can be NCAA list or Wikipedia.
  • There are around 360 colleges in the list and the following details need to be extracted
    • Name
    • Address
    • State
    • Zipcode
    • URLs
Deployment Guide and Validation Document

Make sure to require two separate documents for validation.

A README.md that covers:
  • Deployment - that covers how to build and test your submission.
  • Configuration - make sure to document the configuration that are used by the submission.
  • Dependency Installation -  should clearly describe the step-by-step guide for installing dependencies and should be up to date.
A Validation.md that covers:
Validation of each requirement can be mentioned in this document which will be easier for reviewers to map the requirements with your submission.

Final Submission Guidelines

  • Submit your source code as zip file

ELIGIBLE EVENTS:

Topcoder Open 2019

Review style

Final Review

Community Review Board

Approval

User Sign-Off

ID: 30072559