- Place: Opennebula cluster unine.ch
- Date: 11.5.2013
- Wikipedia-languages: ar,af,be
- worker-vms: 2
- crawlers (url-blocks): 4
- hashing: simple (md5 on whole url)
- virtual latency: none
##Seed urls
##worker-vm
- cpu: 1 vcpu, 2cpus
- ram: 2048
- details: ./vms/lshw_worker_001.txt
##results notes:
- expected pages:
find articles/ -type f | wc -l
- crawled pages: everything including errors (404, ...)
expected pages: 362'748
crawled pages: 397'406
crawl started:
crawl ended:
crawl duration:
pages / sec:
##crawler config https://gist.github.com/cederigo/7317124