notes

Personal notes
git clone git://git.laack.co/notes.git
Log | Files | Refs

YaCy.md (972B)


      1 # YaCy
      2 
      3 **Source:** [https://yacy.net/](https://yacy.net/)
      4 
      5 ## Summary
      6 
      7 YaCy is a distributed search engine with P2P and crawling functionality, allowing for the indexing of internal, external resources, and the sharing thereof.
      8 
      9 ## Elements
     10 
     11 ### Crawler
     12 
     13 The crawler can crawl specified pages, branch out from pages, and parse relevant content. This is where the decentralization power comes from because each node can crawl what it wants to. 
     14 
     15 ### Indexer
     16 
     17 The indexer creates and inverted index for crawled pages. 
     18 
     19 ### Administration
     20 
     21 The administration functionality allows for the management of indexed urls, cleanup, resource limiting, and other general administration functionality.
     22 
     23 ### P2P Functionality
     24 
     25 YaCy uses a distributed hash table to allow nodes to discover each other. To allow nodes to contribute to search results, a node may query other nodes for index segments which are then merged together on the local node to give the finalized search results.