Marathi column data translation to english [on hold]
up vote
-3
down vote
favorite
I have some Marathi OwnerNames stored in my table, I want to translate it to English inside API.
I am using c# mvc API
c# sql sql-server asp.net-mvc
New contributor
put on hold as unclear what you're asking by Camilo Terevinto, HoneyBadger, Sami, Panagiotis Kanavos, Owen Pauling Nov 19 at 13:58
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
|
show 2 more comments
up vote
-3
down vote
favorite
I have some Marathi OwnerNames stored in my table, I want to translate it to English inside API.
I am using c# mvc API
c# sql sql-server asp.net-mvc
New contributor
put on hold as unclear what you're asking by Camilo Terevinto, HoneyBadger, Sami, Panagiotis Kanavos, Owen Pauling Nov 19 at 13:58
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
1
en.wikipedia.org/wiki/Marathi_language
– War
Nov 19 at 12:30
ASP.NET strings are Unicode strings.nvarchar
fields in SQL Server are Unicode fields as well. You don't have to convert anything to English. You definitely don't need to convert anything to UTF8 - that what ASP.NET itself does by default when it generates HTML.
– Panagiotis Kanavos
Nov 19 at 12:33
@SurentharP that linked question has nothing to do with this question. Besides, ASP.NET uses Unicode natively. So does SQL Server. No tricks or conversions are needed. StackOverflow is an ASP.NET site which is why I can write Αυτό Εδώ knowing it will be displayed without any problems
– Panagiotis Kanavos
Nov 19 at 12:34
1
@SurajS. as you see, I can pasteVarhadi (Varhādi) (वऱ्हाडी) or Vaidarbhi (वैदर्भी)
from the Wikipeda article and have it displayed in SO just fine. Are you asking how to transliterate the text using English characters ?
– Panagiotis Kanavos
Nov 19 at 12:39
Well, do you really need to translate the strings stored in that column in marathi lang to english lang or you have an issue to store marathi symbols to that column?
– Sami
Nov 19 at 12:41
|
show 2 more comments
up vote
-3
down vote
favorite
up vote
-3
down vote
favorite
I have some Marathi OwnerNames stored in my table, I want to translate it to English inside API.
I am using c# mvc API
c# sql sql-server asp.net-mvc
New contributor
I have some Marathi OwnerNames stored in my table, I want to translate it to English inside API.
I am using c# mvc API
c# sql sql-server asp.net-mvc
c# sql sql-server asp.net-mvc
New contributor
New contributor
edited Nov 19 at 12:49
Sami
6,47031038
6,47031038
New contributor
asked Nov 19 at 12:21
Suraj S.
13
13
New contributor
New contributor
put on hold as unclear what you're asking by Camilo Terevinto, HoneyBadger, Sami, Panagiotis Kanavos, Owen Pauling Nov 19 at 13:58
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
put on hold as unclear what you're asking by Camilo Terevinto, HoneyBadger, Sami, Panagiotis Kanavos, Owen Pauling Nov 19 at 13:58
Please clarify your specific problem or add additional details to highlight exactly what you need. As it's currently written, it’s hard to tell exactly what you're asking. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.
1
en.wikipedia.org/wiki/Marathi_language
– War
Nov 19 at 12:30
ASP.NET strings are Unicode strings.nvarchar
fields in SQL Server are Unicode fields as well. You don't have to convert anything to English. You definitely don't need to convert anything to UTF8 - that what ASP.NET itself does by default when it generates HTML.
– Panagiotis Kanavos
Nov 19 at 12:33
@SurentharP that linked question has nothing to do with this question. Besides, ASP.NET uses Unicode natively. So does SQL Server. No tricks or conversions are needed. StackOverflow is an ASP.NET site which is why I can write Αυτό Εδώ knowing it will be displayed without any problems
– Panagiotis Kanavos
Nov 19 at 12:34
1
@SurajS. as you see, I can pasteVarhadi (Varhādi) (वऱ्हाडी) or Vaidarbhi (वैदर्भी)
from the Wikipeda article and have it displayed in SO just fine. Are you asking how to transliterate the text using English characters ?
– Panagiotis Kanavos
Nov 19 at 12:39
Well, do you really need to translate the strings stored in that column in marathi lang to english lang or you have an issue to store marathi symbols to that column?
– Sami
Nov 19 at 12:41
|
show 2 more comments
1
en.wikipedia.org/wiki/Marathi_language
– War
Nov 19 at 12:30
ASP.NET strings are Unicode strings.nvarchar
fields in SQL Server are Unicode fields as well. You don't have to convert anything to English. You definitely don't need to convert anything to UTF8 - that what ASP.NET itself does by default when it generates HTML.
– Panagiotis Kanavos
Nov 19 at 12:33
@SurentharP that linked question has nothing to do with this question. Besides, ASP.NET uses Unicode natively. So does SQL Server. No tricks or conversions are needed. StackOverflow is an ASP.NET site which is why I can write Αυτό Εδώ knowing it will be displayed without any problems
– Panagiotis Kanavos
Nov 19 at 12:34
1
@SurajS. as you see, I can pasteVarhadi (Varhādi) (वऱ्हाडी) or Vaidarbhi (वैदर्भी)
from the Wikipeda article and have it displayed in SO just fine. Are you asking how to transliterate the text using English characters ?
– Panagiotis Kanavos
Nov 19 at 12:39
Well, do you really need to translate the strings stored in that column in marathi lang to english lang or you have an issue to store marathi symbols to that column?
– Sami
Nov 19 at 12:41
1
1
en.wikipedia.org/wiki/Marathi_language
– War
Nov 19 at 12:30
en.wikipedia.org/wiki/Marathi_language
– War
Nov 19 at 12:30
ASP.NET strings are Unicode strings.
nvarchar
fields in SQL Server are Unicode fields as well. You don't have to convert anything to English. You definitely don't need to convert anything to UTF8 - that what ASP.NET itself does by default when it generates HTML.– Panagiotis Kanavos
Nov 19 at 12:33
ASP.NET strings are Unicode strings.
nvarchar
fields in SQL Server are Unicode fields as well. You don't have to convert anything to English. You definitely don't need to convert anything to UTF8 - that what ASP.NET itself does by default when it generates HTML.– Panagiotis Kanavos
Nov 19 at 12:33
@SurentharP that linked question has nothing to do with this question. Besides, ASP.NET uses Unicode natively. So does SQL Server. No tricks or conversions are needed. StackOverflow is an ASP.NET site which is why I can write Αυτό Εδώ knowing it will be displayed without any problems
– Panagiotis Kanavos
Nov 19 at 12:34
@SurentharP that linked question has nothing to do with this question. Besides, ASP.NET uses Unicode natively. So does SQL Server. No tricks or conversions are needed. StackOverflow is an ASP.NET site which is why I can write Αυτό Εδώ knowing it will be displayed without any problems
– Panagiotis Kanavos
Nov 19 at 12:34
1
1
@SurajS. as you see, I can paste
Varhadi (Varhādi) (वऱ्हाडी) or Vaidarbhi (वैदर्भी)
from the Wikipeda article and have it displayed in SO just fine. Are you asking how to transliterate the text using English characters ?– Panagiotis Kanavos
Nov 19 at 12:39
@SurajS. as you see, I can paste
Varhadi (Varhādi) (वऱ्हाडी) or Vaidarbhi (वैदर्भी)
from the Wikipeda article and have it displayed in SO just fine. Are you asking how to transliterate the text using English characters ?– Panagiotis Kanavos
Nov 19 at 12:39
Well, do you really need to translate the strings stored in that column in marathi lang to english lang or you have an issue to store marathi symbols to that column?
– Sami
Nov 19 at 12:41
Well, do you really need to translate the strings stored in that column in marathi lang to english lang or you have an issue to store marathi symbols to that column?
– Sami
Nov 19 at 12:41
|
show 2 more comments
1 Answer
1
active
oldest
votes
up vote
2
down vote
accepted
If you want an actual translation (like to approximate the meaning in English), you'll either need a professional translator (recommended), or you could get an approximate translation by calling out to a Google Translate or Microsoft Translator API.
However, based on your use case of people's names, it sounds like what you want is to transliterate Marathi characters into an approximation of the sound in English letters, rather than a translation into English. If that's the case, you could probably write a function based loosely on the method Wikipedia uses.
The code that Wikipedia's mr-translit module implements is a series of rules for string replacement using successive Regex.Replace operations. Note that the code is in PHP, but it shouldn't be too hard to mimic their approach by translating the rules into C#. There are too many rules in their example for me to try to translate the whole of the code here, but the approach they've taken is to normalize a few combining diacritic patterns, then attempt to identify distinct words, and successively replace patterns until all that's left is romanized characters. Finally, the string is returned in Unicode Normalization Form C (presumably in case there are other non-Marathi characters in there). In .Net this is the method String.Normalize; in the PHP code from Wikipedia it looks like mw.ustring.toNFC(text)
looks complicated but i will try this. thanks @JasonTrue
– Suraj S.
Nov 19 at 13:21
add a comment |
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
2
down vote
accepted
If you want an actual translation (like to approximate the meaning in English), you'll either need a professional translator (recommended), or you could get an approximate translation by calling out to a Google Translate or Microsoft Translator API.
However, based on your use case of people's names, it sounds like what you want is to transliterate Marathi characters into an approximation of the sound in English letters, rather than a translation into English. If that's the case, you could probably write a function based loosely on the method Wikipedia uses.
The code that Wikipedia's mr-translit module implements is a series of rules for string replacement using successive Regex.Replace operations. Note that the code is in PHP, but it shouldn't be too hard to mimic their approach by translating the rules into C#. There are too many rules in their example for me to try to translate the whole of the code here, but the approach they've taken is to normalize a few combining diacritic patterns, then attempt to identify distinct words, and successively replace patterns until all that's left is romanized characters. Finally, the string is returned in Unicode Normalization Form C (presumably in case there are other non-Marathi characters in there). In .Net this is the method String.Normalize; in the PHP code from Wikipedia it looks like mw.ustring.toNFC(text)
looks complicated but i will try this. thanks @JasonTrue
– Suraj S.
Nov 19 at 13:21
add a comment |
up vote
2
down vote
accepted
If you want an actual translation (like to approximate the meaning in English), you'll either need a professional translator (recommended), or you could get an approximate translation by calling out to a Google Translate or Microsoft Translator API.
However, based on your use case of people's names, it sounds like what you want is to transliterate Marathi characters into an approximation of the sound in English letters, rather than a translation into English. If that's the case, you could probably write a function based loosely on the method Wikipedia uses.
The code that Wikipedia's mr-translit module implements is a series of rules for string replacement using successive Regex.Replace operations. Note that the code is in PHP, but it shouldn't be too hard to mimic their approach by translating the rules into C#. There are too many rules in their example for me to try to translate the whole of the code here, but the approach they've taken is to normalize a few combining diacritic patterns, then attempt to identify distinct words, and successively replace patterns until all that's left is romanized characters. Finally, the string is returned in Unicode Normalization Form C (presumably in case there are other non-Marathi characters in there). In .Net this is the method String.Normalize; in the PHP code from Wikipedia it looks like mw.ustring.toNFC(text)
looks complicated but i will try this. thanks @JasonTrue
– Suraj S.
Nov 19 at 13:21
add a comment |
up vote
2
down vote
accepted
up vote
2
down vote
accepted
If you want an actual translation (like to approximate the meaning in English), you'll either need a professional translator (recommended), or you could get an approximate translation by calling out to a Google Translate or Microsoft Translator API.
However, based on your use case of people's names, it sounds like what you want is to transliterate Marathi characters into an approximation of the sound in English letters, rather than a translation into English. If that's the case, you could probably write a function based loosely on the method Wikipedia uses.
The code that Wikipedia's mr-translit module implements is a series of rules for string replacement using successive Regex.Replace operations. Note that the code is in PHP, but it shouldn't be too hard to mimic their approach by translating the rules into C#. There are too many rules in their example for me to try to translate the whole of the code here, but the approach they've taken is to normalize a few combining diacritic patterns, then attempt to identify distinct words, and successively replace patterns until all that's left is romanized characters. Finally, the string is returned in Unicode Normalization Form C (presumably in case there are other non-Marathi characters in there). In .Net this is the method String.Normalize; in the PHP code from Wikipedia it looks like mw.ustring.toNFC(text)
If you want an actual translation (like to approximate the meaning in English), you'll either need a professional translator (recommended), or you could get an approximate translation by calling out to a Google Translate or Microsoft Translator API.
However, based on your use case of people's names, it sounds like what you want is to transliterate Marathi characters into an approximation of the sound in English letters, rather than a translation into English. If that's the case, you could probably write a function based loosely on the method Wikipedia uses.
The code that Wikipedia's mr-translit module implements is a series of rules for string replacement using successive Regex.Replace operations. Note that the code is in PHP, but it shouldn't be too hard to mimic their approach by translating the rules into C#. There are too many rules in their example for me to try to translate the whole of the code here, but the approach they've taken is to normalize a few combining diacritic patterns, then attempt to identify distinct words, and successively replace patterns until all that's left is romanized characters. Finally, the string is returned in Unicode Normalization Form C (presumably in case there are other non-Marathi characters in there). In .Net this is the method String.Normalize; in the PHP code from Wikipedia it looks like mw.ustring.toNFC(text)
answered Nov 19 at 12:53
JasonTrue
17.2k32855
17.2k32855
looks complicated but i will try this. thanks @JasonTrue
– Suraj S.
Nov 19 at 13:21
add a comment |
looks complicated but i will try this. thanks @JasonTrue
– Suraj S.
Nov 19 at 13:21
looks complicated but i will try this. thanks @JasonTrue
– Suraj S.
Nov 19 at 13:21
looks complicated but i will try this. thanks @JasonTrue
– Suraj S.
Nov 19 at 13:21
add a comment |
1
en.wikipedia.org/wiki/Marathi_language
– War
Nov 19 at 12:30
ASP.NET strings are Unicode strings.
nvarchar
fields in SQL Server are Unicode fields as well. You don't have to convert anything to English. You definitely don't need to convert anything to UTF8 - that what ASP.NET itself does by default when it generates HTML.– Panagiotis Kanavos
Nov 19 at 12:33
@SurentharP that linked question has nothing to do with this question. Besides, ASP.NET uses Unicode natively. So does SQL Server. No tricks or conversions are needed. StackOverflow is an ASP.NET site which is why I can write Αυτό Εδώ knowing it will be displayed without any problems
– Panagiotis Kanavos
Nov 19 at 12:34
1
@SurajS. as you see, I can paste
Varhadi (Varhādi) (वऱ्हाडी) or Vaidarbhi (वैदर्भी)
from the Wikipeda article and have it displayed in SO just fine. Are you asking how to transliterate the text using English characters ?– Panagiotis Kanavos
Nov 19 at 12:39
Well, do you really need to translate the strings stored in that column in marathi lang to english lang or you have an issue to store marathi symbols to that column?
– Sami
Nov 19 at 12:41