Accumulo supports building a set of sample data that can be efficiently accessed by scanners. What data is included in the sample set is configurable. Below, some data representing documents are inserted.
root@instance sampex> createtable sampex
root@instance sampex> insert 9255 doc content 'abcde'
root@instance sampex> insert 9255 doc url file://foo.txt