如何从给定的ISO语言代码检索匹配的宏语言区域设置?

4
给定一个属于个别范畴的ISO 639-2/T语言代码,如何在程序中找到相应的宏语言代码,如果存在匹配的话?
例如,如何从“nob”(挪威博克马尔语,个体范畴)转换为“nor”(挪威语,宏语言范畴)?
通常情况下,在同一国家内可能有多种不属于同一宏语言的个体语言,仅仅根据国家分组会得出错误的结果。
java.util.locale 知道 ISO 639 三位字母的语言代码,并识别上述示例中的两个代码,但它没有涵盖“scope nor macrolanguage”的概念。
在我的案例中,即使是一个没有误报的启发式算法也很有用。
1个回答

2
您可以制作您自己的宏语言列表,以及相应的单独语言。
以下是列表: https://iso639-3.sil.org/code_tables/639/data/all?title=&field_iso639_cd_st_mmbrshp_639_1_tid=All&name_3=&field_iso639_element_scope_tid=76&field_iso639_language_type_tid=All&items_per_page=200 这是我之前挑选的一些语言:
public static final Map<String, String> macroLanguages = new HashMap<>();
static {
    macroLanguages.put("aao", "ara"); //https://iso639-3.sil.org/code/ara
    macroLanguages.put("abh", "ara");
    macroLanguages.put("abv", "ara");
    macroLanguages.put("acm", "ara");
    macroLanguages.put("acq", "ara");
    macroLanguages.put("acw", "ara");
    macroLanguages.put("acx", "ara");
    macroLanguages.put("acy", "ara");
    macroLanguages.put("adf", "ara");
    macroLanguages.put("aeb", "ara");
    macroLanguages.put("aec", "ara");
    macroLanguages.put("afb", "ara");
    macroLanguages.put("ajp", "ara");
    macroLanguages.put("apc", "ara");
    macroLanguages.put("apd", "ara");
    macroLanguages.put("arb", "ara");
    macroLanguages.put("arq", "ara");
    macroLanguages.put("ars", "ara");
    macroLanguages.put("ary", "ara");
    macroLanguages.put("arz", "ara");
    macroLanguages.put("auz", "ara");
    macroLanguages.put("avl", "ara");
    macroLanguages.put("ayh", "ara");
    macroLanguages.put("ayl", "ara");
    macroLanguages.put("ayn", "ara");
    macroLanguages.put("ayp", "ara");
    macroLanguages.put("bbz", "ara");
    macroLanguages.put("pga", "ara");
    macroLanguages.put("shu", "ara");
    macroLanguages.put("ssh", "ara");

    macroLanguages.put("ekk", "est"); //https://iso639-3.sil.org/code/est
    macroLanguages.put("vro", "est");

    macroLanguages.put("bos", "hbs"); //https://iso639-3.sil.org/code/hbs
    macroLanguages.put("hrv", "hbs");
    macroLanguages.put("srp", "hbs");
    macroLanguages.put("cnr", "hbs");

    macroLanguages.put("ltg", "lav"); //https://iso639-3.sil.org/code/lav
    macroLanguages.put("lvs", "lav");

    macroLanguages.put("nno", "nor"); //https://iso639-3.sil.org/code/nor
    macroLanguages.put("nob", "nor");

    macroLanguages.put("aae", "sqi"); //https://iso639-3.sil.org/code/sqi
    macroLanguages.put("aat", "sqi");
    macroLanguages.put("aln", "sqi");
    macroLanguages.put("als", "sqi");

    macroLanguages.put("ydd", "yid"); //https://iso639-3.sil.org/code/yid
    macroLanguages.put("yih", "yid");

    macroLanguages.put("ccx", "zha"); //https://iso639-3.sil.org/code/zha
    macroLanguages.put("ccy", "zha");
    macroLanguages.put("zch", "zha");
    macroLanguages.put("zeh", "zha");
    macroLanguages.put("zgb", "zha");
    macroLanguages.put("zgm", "zha");
    macroLanguages.put("zgn", "zha");
    macroLanguages.put("zhd", "zha");
    macroLanguages.put("zhn", "zha");
    macroLanguages.put("zlj", "zha");
    macroLanguages.put("zln", "zha");
    macroLanguages.put("zlq", "zha");
    macroLanguages.put("zqe", "zha");
    macroLanguages.put("zyb", "zha");
    macroLanguages.put("zyg", "zha");
    macroLanguages.put("zyj", "zha");
    macroLanguages.put("zyn", "zha");
    macroLanguages.put("zzj", "zha");

    macroLanguages.put("cdo", "zho"); //https://iso639-3.sil.org/code/zho
    macroLanguages.put("cjy", "zho");
    macroLanguages.put("cmn", "zho");
    macroLanguages.put("cpx", "zho");
    macroLanguages.put("czh", "zho");
    macroLanguages.put("czo", "zho");
    macroLanguages.put("gan", "zho");
    macroLanguages.put("hak", "zho");
    macroLanguages.put("hsn", "zho");
    macroLanguages.put("lzh", "zho");
    macroLanguages.put("mnp", "zho");
    macroLanguages.put("nan", "zho");
    macroLanguages.put("wuu", "zho");
    macroLanguages.put("yue", "zho");
    macroLanguages.put("cnp", "zho");
    macroLanguages.put("csp", "zho");

    macroLanguages.put("pes", "fas"); //https://iso639-3.sil.org/code/fas
    macroLanguages.put("prs", "fas");
}

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接