Challenge Overview

Challenge Overview

Given a set of interest tags (e.g., machine learning), search the whole Internet (using Google and/or other resources) for likely matches (i.e., URLs), and assign a match probability to each URL.

Background

Open Ecosystem Network is an open co-creation environment. The goal is to bring together the most ambitious companies and brightest individuals to help them innovate and grow their businesses bigger, better and faster. The solution which you are going to build should help us in this direction to find interesting leads and be able to reach out to them.

Task Detail

The interest tags are mainly used to describe a project. Please refer to https://www.open-ecosystem.org to understand the context and background to this requirement.
 
In this task, given a set of interest tags, you are asked to design a method, which can retrieve web pages related to this set from the whole Internet, and then extract the useful information from these web pages, such as company/person names, contact email, and contact number. The final output is expected to be a ranked list of URLs and their extracted information.
 
During the search/retrieval process, you are allowed to use other (paid) resources. For example, you may want to use Google search as a tool. Please note that, some websites, like LinkedIn, they may have more structured data, however, they cannot cover the whole Internet.
 

Examples

Here are examples of a 100% Successful Response to Intelligent Web Crawler search Query.
 
Ideal Response 1
Search Query Parameter: Innovation, Network, Platforms

Entity Name: Ideator
Product/ Services: Innovation Software, Enterprise Networking Software
Contact Name: Greg Schutz
Contact Designation: Head of Business Development & Sales
Contact Email: info@ideator.com, greg.s@ideator.com
Phone: +1-415-806-7865
Website: https://ideator.com/
 
Other contacts:
Facebook: https://www.facebook.com/ideatorinc
LinkedIn: https://www.linkedin.com/company/ideator-com
Twitter: https://twitter.com/ideatorinc
 
Location: San Francisco, California, United States
 
Synopsis of Services:
Ideator helps you find co-founders or team members, connect with advisors and mentors, track your goals, and collaborate with other innovators, Improve engagement for your innovation or entrepreneurship program, organize competitions, source new ideas, increase revenue, and foster collaboration, Get real-time visibility into the success of your program with data, analytics, and the key metrics that are driving your innovation or entrepreneurship program
 
Sources:
https://ideator.com/
https://www.linkedin.com/in/gregthomasschutz/
https://www.crunchbase.com/organization/ideator-inc#section-overview
https://www.bloomberg.com/research/stocks/private/snapshot.asp?privcapId=415368599
 
----------------------------------------------------------------------------------------------------------------------
  
Ideal Response 2
Search Query Parameter: Artificial Intelligence, Sales Analytics
 
Entity Name: Zilliant
Product/ Services: Zilliant IQ Platforms, Machine Learning, predictive analysis for sales and B2B Customer relationships
Website: https://www.zilliant.com/
 
Contact 1
Contact Name: Tom Woddard;
Contact Designation: Business Development Director
Contact Email: info@ zilliant.com, tom.wod@zilliant.com
Phone: +1-908-806-7865,
 
Contact 2
Contact Name: Greg Artwater
Contact Designation: Regional Sales Manager EMEA
Contact Email: info@ zilliant.com, greg.artwater@zilliant.com
Phone: +1-908-806-5665
 
Other contacts:
Facebook: https://www.facebook.com/zilliantinc/
LinkedIn: https://www.linkedin.com/company/zilliant
Twitter: https://twitter.com/Zilliant
Google+ : https://plus.google.com/+ZilliantIncorporated
 
Location: Austin, Texas, United States
 
Synopsis of Services:
Zilliant IQ Engines use advanced AI, machine learning techniques, and sophisticated B2B predictive analytics to quantify the true economic potential of every customer, and intelligently prescribe actions that drive better revenue and margin outcomes.
 
Sources:
https://www.zilliant.com/
https://www.crunchbase.com/organization/zilliant
https://www.bloomberg.com/research/stocks/private/snapshot.asp?privcapId=93557
https://www.glassdoor.com/Overview/Working-at-Zilliant-EI_IE17000.11,19.htm
https://appexchange.salesforce.com/listingDetail?listingId=a0N30000009wKLZEA2

------------------------------------------------------------------------------------------------------------

Ideal Response 3
Search Query Parameter: Artificial Intelligence, Image Recognition, Analytics
 
Entity Name: Clarifai
Product/ Services: Machine Learning, Image recognition and classification, intelligent applications
Website: https://clarifai.com/
 
Contact 1
Contact Name: Matt Zeef;
Contact Designation: CMO
Contact Email: info@ clarifai.io, matt.z@clarifai.io
Phone: +1-908-806-7865,
 
Contact 2
Contact Name: Joe Blogs
Contact Designation: VP Sales
Contact Email: joe.b@clariai.io
Phone: +1-908-806-5665
 
Other contacts:
Facebook: https://www.facebook.com/Clarifai
LinkedIn: https://www.linkedin.com/company/clarifai
Twitter:  https://twitter.com/clarifai
Instagram : https://www.instagram.com/clarifai
 
Location: NYC, New York, United States
 
Synopsis of Services:
Experts in machine learning, Clarifai has been a market leader since winning the top five places in image classification at the ImageNet 2013 competition. Clarifai’s powerful image and video recognition solutions are built on the most advanced machine learning platform, and made easily accessible via API, device SDK, and on-premise, empowering businesses all over the world to build a new generation of intelligent applications..
 
Sources:
https://www.kdnuggets.com/2017/03/50-companies-leading-ai-revolution-detailed.html
https://www.crunchbase.com/organization/clarifai
 
Note: Having all kinds of information but no way to reach out to a contact, would defeat the purpose of this approach. An additional advantage would be to have an AI component to the software such that the subsequent Reponses produced would be based on the previous responses selected.  i.e. the AI learns the users' preference based on previously selected responses like Netflix.
 

We would say we need at least:

1.     Contact information (name & email)

2.     Company name

3.     Company URL

4.     Summary of offering



Final Submission Guidelines

Commercialization

This is a HARD requirement. You have to make sure your proposal can be used for the commercial purpose. If your solution involves any commercial license, please justify how can we purchase it and how much does it cost.

Final Submission

Format

  • A document should be minimum of 2 pages in PDF / Word format to describe your ideas.
  • It should be written in English.
  • Leveraging charts, diagrams, and tables to explain your ideas is encouraged from a comprehensive perspective.
  • PoC source files along with build and deployment steps

Judging Criteria

You will be judged on the quality of your ideas, the quality of your description of the ideas, and how much benefit it can provide to the client. The winner will be chosen by the most logical and convincing reasoning as to how and why the idea presented will meet the objective. Note that, this contest will be judged subjectively by the client and Topcoder. However, the judging criteria will largely be the basis for the judgment.
 
  1. Effectiveness (40%)
    1. How good is the solution (% of successful entries for a defined standard trial query)?
    2. Stability of solution (How reliable is the solution, does it return the required results consistently)
    3. The quality of the final ranked list is the first priority. Also, the extracted information should be as relevant as possible.
  2. Feasibility (60%)
    1. Please justify how general your proposed method is.
    2. Ongoing costs associated with the solution (lifetime costs, any costs relating to maintenance and licenses)
    3. Proof of concept (PoC) codes are required.
    4. MongoDB is preferred

Submission Guideline

You can submit at most TWO solutions, but we encourage you to include your great solution and details as much as possible in a single submission.

ELIGIBLE EVENTS:

2018 Topcoder(R) Open

Review style

Final Review

Community Review Board

Approval

User Sign-Off

ID: 30064313