Created
July 18, 2016 21:34
-
-
Save pramsey/59c7949fa34429cb0f7083db435a7213 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
blocking... | |
creating blocking_map database | |
creating inverted indexes | |
writing blocking map | |
Traceback (most recent call last): | |
File "1_identify_address_entities.py", line 208, in <module> | |
csv_writer.writerows(b_data) | |
File "/Library/Python/2.7/site-packages/dedupe/blocking.py", line 42, in __call__ | |
block_keys = predicate(instance) | |
File "/Library/Python/2.7/site-packages/dedupe/predicates.py", line 224, in __call__ | |
for predicate in self] | |
File "/Library/Python/2.7/site-packages/dedupe/predicates.py", line 66, in __call__ | |
return self.func(strip_punc(column)) | |
File "/Library/Python/2.7/site-packages/dedupe/predicates.py", line 23, in strip_punc | |
s = s.encode('utf-8').translate(None, PUNCTUATION) | |
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 0: ordinal not in range(128) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment