wondering if a screenshot generated on the user side (i imagine a chrome extension that a webarchivist uses to nominate seeds, and something like https://developer.chrome.com/extensions/tabs#method-captureVisibleTab to capture the image) could be compared with a screenshot generated server side (phantomjs or similar) to spot differences and improve quality of the archive
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
### Keybase proof | |
I hereby claim: | |
* I am atomotic on github. | |
* I am atomotic (https://keybase.io/atomotic) on keybase. | |
* I have a public key whose fingerprint is 84EF B4A9 3159 78FC 98C6 BED1 7BEC E8BD A634 1C49 | |
To claim this, I am signing this object: |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
package EPrints::Plugin::Export::DEPOSITOLEGALE; | |
# eprint needs magic documents field | |
# documents needs magic files field | |
use EPrints v3.3.0; | |
use EPrints::Plugin::Export::XMLFile; | |
@ISA = ( "EPrints::Plugin::Export::DEPOSITOLEGALE" ); |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env bash | |
declare -A maps | |
maps=( | |
["osm-emr"]="http://geodati.fmach.it/gfoss_geodata/osm/output_img/emilia-romagna.tar.gz" | |
["osm-vdo"]="http://geodati.fmach.it/gfoss_geodata/osm/output_img/valle-aosta.tar.gz" | |
["osm-piemonte"]="http://geodati.fmach.it/gfoss_geodata/osm/output_img/piemonte.tar.gz" | |
["osm-trentino"]="http://geodati.fmach.it/gfoss_geodata/osm/output_img/trentino-alto-adige.tar.gz" | |
["osm-basilicata"]="http://geodati.fmach.it/gfoss_geodata/osm/output_img/basilicata.tar.gz" |
"Al fondo, vedo (non certo nei curatori del progetto, ma come sottofondo ancora presente nella mente di alcuni bibliotecari italiani), l'idea che le opere conservate in una biblioteca siano in qualche modo "possedute" dalla biblioteca. Conservare (che è la mission delle biblioteche per le opere storiche) non implica acquisire diritti, quanto piuttosto, semmai, doveri, come appunto quello della massima valorizzazione delle opere. Al giorno d'oggi questo non può che significare rilasciare anche le digitalizzazioni in pubblico dominio (o CC0)."
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
package main | |
import ( | |
"bufio" | |
"bytes" | |
"fmt" | |
"github.com/richardlehane/siegfried" | |
"github.com/slyrz/warc" | |
"io/ioutil" | |
"log" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
. heritrix.conf | |
if [ -z "$1" ] || [ -z "$2" ]; then | |
echo usage: $0 jobname seedsfile | |
exit | |
fi | |
JOB=$1 |
add to your .zshrc/.bashrc this function (jq is required)
function ia-latest() { curl -s http://archive.org/wayback/available\?url=$* | jq -r '.archived_snapshots.closest.url' }
run
$ ia-latest http://twitter.com/atomotic
http://web.archive.org/web/20131230143739/https://twitter.com/atomotic
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
source "http://rubygems.org" | |
gem "oai", :git => "https://github.com/tjdett/ruby-oai.git", :branch => "seamless-resumption" | |
gem "redis" | |
gem "libxml-ruby" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
$ pip install warcprox | |
$ brew install phantomjs | |
$ warcprox -c warcprox.pem | |
$ phantomjs --proxy=localhost:8000 \ | |
--ssl-certificates-path=warcprox.pem \ | |
/usr/local/Cellar/phantomjs/2.0.0/share/phantomjs/examples/rasterize.js \ | |
http://{URL} screenshot.png "1024px*768px" |