Skip to content

Instantly share code, notes, and snippets.

@pramsey
Created July 18, 2016 21:34
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save pramsey/59c7949fa34429cb0f7083db435a7213 to your computer and use it in GitHub Desktop.
Save pramsey/59c7949fa34429cb0f7083db435a7213 to your computer and use it in GitHub Desktop.
blocking...
creating blocking_map database
creating inverted indexes
writing blocking map
Traceback (most recent call last):
File "1_identify_address_entities.py", line 208, in <module>
csv_writer.writerows(b_data)
File "/Library/Python/2.7/site-packages/dedupe/blocking.py", line 42, in __call__
block_keys = predicate(instance)
File "/Library/Python/2.7/site-packages/dedupe/predicates.py", line 224, in __call__
for predicate in self]
File "/Library/Python/2.7/site-packages/dedupe/predicates.py", line 66, in __call__
return self.func(strip_punc(column))
File "/Library/Python/2.7/site-packages/dedupe/predicates.py", line 23, in strip_punc
s = s.encode('utf-8').translate(None, PUNCTUATION)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 0: ordinal not in range(128)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment