Cloud Based Google-like Search Machine Details

Built a cloud-based Google-style search engine from scratch using core Java libraries. Components include a dynamic web server, crawler, Spark-like data processing engine, indexer, PageRank, and ranker. Wemina, the search engine, suggests search queries based on the most popular searches, and updates them every time user clicks on the keyboard. Wemina is also able to display results for a subtle mispelled words (please refer to results for the "University of Pennsylvania" mispelled query, the last photo (#9)). Searches are based on 720K crowled pages, of which 350K were indexed. Deployed the project on AWS.