Last active
November 23, 2016 16:00
-
-
Save sharonhe/2360692ec8d9d0404ee89565d2bb28fa to your computer and use it in GitHub Desktop.
Trying to figure out text encoding
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
What's the difference between the following two lines? The first one is copied from a webpage and the second one is typed out. | |
None of my text editors understand the first one...I want some way to go from the first line to the second, without having to type it out again. | |
𝙶𝙶𝙲𝙶𝙲 | |
GGCGC | |
Thanks a lot, this works! (And yeah, GitHub wont let me type those characters in a comment, but somehow they work in the gist..
$ echo "GGCGC" | iconv -f utf-8 -t ascii//translit
GGCGC
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
And you could something like the following to convert whole files
iconv -f utf-8 -t ascii//translit originalfile > newfile
Where original file is the filename with the weird chars and newfile is a new file to which you would like to output.