EbayCrawler

EbayCrawler is a Spring Boot web crawler.

EbayCrawler will crawl all the links in a specific URL and return tree like data structure that include url, http status, child links.

Prerequisites

Before you begin, ensure you have met the following requirements:

You have installed at least java 8.

Installing EbayCrawler

To install EbayCrawler, follow these steps:

clone the project from github.
clean and install using maven.

Using EbayCrawler

To use EbayCrawler, follow these steps:

run the jar.
send request with postman

Assumptions

For Task 3 Improve the performance of your CrawlLinks API so it can support high crawlingDepth values (100 and more): I add in memory cache that will store all the visited links so if we visit it again we can retrieve the tree from the cache instead of crawl again and again.
For more scalable solution we can add redis db to store the visited links in cache db.
Another solution that require more time is to implement the crawler in multi-threads.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EbayCrawler

Prerequisites

Installing EbayCrawler

Using EbayCrawler

Assumptions

About

Uh oh!

Releases

Packages

Languages

ridg18/ebay-crawler

Folders and files

Latest commit

History

Repository files navigation

EbayCrawler

Prerequisites

Installing EbayCrawler

Using EbayCrawler

Assumptions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages