Skip to content

Instantly share code, notes, and snippets.

@mohdsanadzakirizvi
Created August 7, 2019 02:34
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save mohdsanadzakirizvi/fc1eabbf24ce2f804a8044b43f84aff0 to your computer and use it in GitHub Desktop.
Save mohdsanadzakirizvi/fc1eabbf24ce2f804a8044b43f84aff0 to your computer and use it in GitHub Desktop.
# create a character mapping index
chars = sorted(list(set(data_new)))
mapping = dict((c, i) for i, c in enumerate(chars))
def encode_seq(seq):
sequences = list()
for line in seq:
# integer encode line
encoded_seq = [mapping[char] for char in line]
# store
sequences.append(encoded_seq)
return sequences
# encode the sequences
sequences = encode_seq(sequences)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment