Created
April 5, 2021 19:21
-
-
Save avriiil/a23ba9304c2af593f62186b847658fde to your computer and use it in GitHub Desktop.
Remove diacritics from Arabic text
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# import the dediacritization tool | |
from camel_tools.utils.dediac import dediac_ar | |
# apply to your text column | |
df.tweet_text = df.tweet_text.apply(dediac_ar) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
what this "df"?