我要求按名称搜索人员.人们的姓名可以是英语,韩语或中文.为此,我使用Like条件在Name的基础上搜索如下:
select * from [MyTable] where Name like N'%t%'
上述声明给出了包含字母t的所有用户.但这不适用于韩语或中文.就像我用韩文字母search搜索一样,它应该给出包含这个字母的所有名字,如**정수연,재훈아이팟,정원혁테스트7 **.我尝试了以下方法,但它的结果为零
select * from [MyTable] where Name like N'%ㅈ%' - No Results
select PATINDEX(N'%ㅈ%',N'정수연(Mohan)') - giving value as ZERO
select Charindex(N'ㅈ',N'정수연') - giving value as ZERO
有没有办法在SQL服务器中找到其他语言字母的字母?
我知道如何使用编码技术在C#单词中找到其他语言的字母表存在但不在SQL服务器中.请帮助我这方面.
提前致谢.
编辑C#代码
public static string DecomposeSyllabels(string unicodeString) {
try {
//Consonant consonant only used
string[] JLT = { "ㄱ", "ㄲ", "ㄴ", "ㄷ", "ㄸ", "ㄹ", "ㅁ", "ㅂ", "ㅃ", "ㅅ", "ㅆ", "ㅇ", "ㅈ", "ㅉ", "ㅊ", "ㅋ", "ㅌ", "ㅍ", "ㅎ" };
// Only used a collection of neutral
string[] JVT = { "ㅏ", "ㅐ", "ㅑ", "ㅒ", "ㅓ", "ㅔ", "ㅕ", "ㅖ", "ㅗ", "ㅘ", "ㅙ", "ㅚ", "ㅛ", "ㅜ", "ㅝ", "ㅞ", "ㅟ", "ㅠ", "ㅡ", "ㅢ", "ㅣ" };
// Initial and coda consonants used in
string[] JTT = { "", "ㄱ", "ㄲ", "ㄳ", "ㄴ", "ㄵ", "ㄶ", "ㄷ", "ㄹ", "ㄺ", "ㄻ", "ㄼ", "ㄽ", "ㄾ", "ㄿ", "ㅀ", "ㅁ", "ㅂ", "ㅄ", "ㅅ", "ㅆ", "ㅇ", "ㅈ", "ㅊ", "ㅋ", "ㅌ", "ㅍ", "ㅎ" };
double SBase = 0xAC00;
long SCount = 11172;
int TCount = 28;
int NCount = 588;
string syllables = string.Empty;
foreach (char c in unicodeString) {
double SIndex = (int)c - SBase;
if (0 > SIndex || SIndex >= SCount) {
syllables = syllables + c;
continue;
}
int LIndex = (int)Math.Floor(SIndex / NCount);
int VIndex = (int)(Math.Floor((SIndex % NCount) / TCount));
int TIndex = (int)(SIndex % TCount);
syllables = syllables + (JLT[LIndex] + JVT[VIndex] + JTT[TIndex]);
}
return syllables;
}
catch {
return unicodeString;
}
}
最佳答案 您必须分解韩语音节并将它们存储在SQL数据库的单独列中(例如ㅈㅓㅇㅅㅕㄴㅕㄴfor정수연).我建议您编写一个小型自定义应用程序来解析您的数据库,分解所有韩语音节,并将结果保存到单独的列中.
编辑
这里有一些分解Hangul音节的Python代码:
#!/usr/local/bin/python
# -*- coding: utf8 -*-
import codecs, sys, os, math
JLT="ㄱ,ㄲ,ㄴ,ㄷ,ㄸ,ㄹ,ㅁ,ㅂ,ㅃ,ㅅ,ㅆ,ㅇ,ㅈ,ㅉ,ㅊ,ㅋ,ㅌ,ㅍ,ㅎ".split(",")
JTT=",ㄱ,ㄲ,ㄱㅅ,ㄴ,ㄴㅈ,ㄴㅎ,ㄷ,ㄹ,ㄹㄱ,ㄹㅁ,ㄹㅂ,ㄹㅅ,ㄹㅌ,ㄹㅍ,ㄹㅎ,ㅁ,ㅂ,ㅂㅅ,ㅅ,ㅆ,ㅇ,ㅈ,ㅊ,ㅋ,ㅌ,ㅍ,ㅎ".split(",")
JVT="ㅏ,ㅐ,ㅑ,ㅒ,ㅓ,ㅔ,ㅕ,ㅖ,ㅗ,ㅘ,ㅙ,ㅚ,ㅛ,ㅜ,ㅝ,ㅞ,ㅟ,ㅠ,ㅡ,ㅢ,ㅣ".split(",")
SBase=0xAC00
SCount=11172
TCount=28
NCount=588
def HangulName(a):
b=a.decode('utf8')
sound=''
for i in b:
cp=ord(i)
SIndex = cp - SBase
if (0 > SIndex or SIndex >= SCount):
# "Not a Hangul Syllable"
pass
LIndex = int(math.floor(SIndex / NCount))
VIndex = int(math.floor((SIndex % NCount) / TCount))
TIndex = int(SIndex % TCount)
sound=sound+(JLT[LIndex] + JVT[VIndex] + JTT[TIndex]).lower()
return sound
print HangulName("정수연")
dda$python test.py
ㅈㅓㅇㅅㅜㅇㅕㄴ