Skip to content

Instantly share code, notes, and snippets.

@devsergiy
Created March 12, 2012 18:15
Show Gist options
  • Save devsergiy/2023743 to your computer and use it in GitHub Desktop.
Save devsergiy/2023743 to your computer and use it in GitHub Desktop.
#place this in config/locales/translit.rb
# encoding: utf-8
# I18n transliteration delegates to Russian::Transliteration (we're unable
# to use common I18n transliteration tables with Russian)
#
# Правило транслитерации для I18n использует Russian::Transliteration
# (использовать обычный механизм и таблицу транслитерации I18n с
# русским языком не получится)
{
:ru => {
:i18n => {
:transliterate => {
:rule => lambda { |str| Transliteration.transliterate(str) }
}
}
}
}
#and this into libs with file name transliteration.rb
#don't forget to enable autoloading files from libs by adding next line into application.rb
#config.autoload_paths += %W(#{config.root}/lib)
# encoding: utf-8
# Russian transliteration
#
# Транслитерация для букв русского алфавита
module Transliteration
extend self
# Transliteration heavily based on rutils gem by Julian "julik" Tarkhanov and Co.
# <http://rutils.rubyforge.org/>
# Cleaned up and optimized.
LOWER_SINGLE = {
"і"=>"i","ґ"=>"g","ё"=>"yo","№"=>"#","є"=>"e",
"ї"=>"yi","а"=>"a","б"=>"b",
"в"=>"v","г"=>"g","д"=>"d","е"=>"e","ж"=>"zh",
"з"=>"z","и"=>"i","й"=>"y","к"=>"k","л"=>"l",
"м"=>"m","н"=>"n","о"=>"o","п"=>"p","р"=>"r",
"с"=>"s","т"=>"t","у"=>"u","ф"=>"f","х"=>"h",
"ц"=>"ts","ч"=>"ch","ш"=>"sh","щ"=>"sch","ъ"=>"'",
"ы"=>"y","ь"=>"","э"=>"e","ю"=>"yu","я"=>"ya",
}
LOWER_MULTI = {
"ье"=>"ie",
"ьё"=>"ie",
}
UPPER_SINGLE = {
"Ґ"=>"G","Ё"=>"YO","Є"=>"E","Ї"=>"YI","І"=>"I",
"А"=>"A","Б"=>"B","В"=>"V","Г"=>"G",
"Д"=>"D","Е"=>"E","Ж"=>"ZH","З"=>"Z","И"=>"I",
"Й"=>"Y","К"=>"K","Л"=>"L","М"=>"M","Н"=>"N",
"О"=>"O","П"=>"P","Р"=>"R","С"=>"S","Т"=>"T",
"У"=>"U","Ф"=>"F","Х"=>"H","Ц"=>"TS","Ч"=>"CH",
"Ш"=>"SH","Щ"=>"SCH","Ъ"=>"'","Ы"=>"Y","Ь"=>"",
"Э"=>"E","Ю"=>"YU","Я"=>"YA",
}
UPPER_MULTI = {
"ЬЕ"=>"IE",
"ЬЁ"=>"IE",
}
LOWER = (LOWER_SINGLE.merge(LOWER_MULTI)).freeze
UPPER = (UPPER_SINGLE.merge(UPPER_MULTI)).freeze
MULTI_KEYS = (LOWER_MULTI.merge(UPPER_MULTI)).keys.sort_by {|s| s.length}.reverse.freeze
# Transliterate a string with russian characters
#
# Возвращает строку, в которой все буквы русского алфавита заменены на похожую по звучанию латиницу
def transliterate(str)
chars = str.scan(%r{#{MULTI_KEYS.join '|'}|\w|.})
result = ""
chars.each_with_index do |char, index|
if UPPER.has_key?(char) && LOWER.has_key?(chars[index+1])
# combined case
result << UPPER[char].downcase.capitalize
elsif UPPER.has_key?(char)
result << UPPER[char]
elsif LOWER.has_key?(char)
result << LOWER[char]
else
result << char
end
end
result
end
end
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment