Created

Embed URL

HTTPS clone URL

SSH clone URL

You can clone with HTTPS or SSH.

Download Gist

Repairing a unicode string that contains invalid characters

View demo.rb
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
# encoding: utf-8
 
s = "Blah \xe9 blah 헌글"
 
puts "BEFORE"
puts "encoding: #{s.encoding}"
puts "valid : #{s.valid_encoding?}"
puts "text : #{s}"
 
s = s.
encode('utf-16le', 'utf-8',
:invalid => :replace,
:undef => :replace,
:replace => "#").
encode('utf-8')
 
puts "\nAFTER"
puts "encoding: #{s.encoding}"
puts "valid : #{s.valid_encoding?}"
puts "text : #{s}"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Something went wrong with that request. Please try again.