PHP 中 mb_get_info 函数怎么用？用它来查看和处理文件编码的方法详解

gitbox 2025-05-11

在处理多语言文本或需要确保字符编码一致性的 PHP 项目中，mbstring 扩展扮演了关键角色。而其中的 mb_get_info() 函数，是一个非常实用的工具，它能让你快速查看当前 mbstring 的配置信息，为排查字符编码问题提供重要参考。

一、mb_get_info() 是什么？

mb_get_info() 是 PHP 中 mbstring 扩展提供的一个函数，用于获取当前多字节字符串环境的配置信息。

语法：

mb_get_info(string $type = null): array|string|false

$type：可选参数，指定要获取的信息类型。如果为空，则返回所有信息（以关联数组形式返回）。
返回值：当指定 $type 时，返回一个字符串；如果未指定，返回关联数组。

示例输出：

print_r(mb_get_info());

输出类似如下内容：

Array
(
    [internal_encoding] => UTF-8
    [http_input] => pass
    [http_output] => pass
    [input_encoding] => UTF-8
    [output_encoding] => UTF-8
    [language] => neutral
    [encoding_translation] => Off
)

这表示当前脚本中，PHP 使用的字符编码为 UTF-8。

二、如何使用 mb_get_info 来检查文件编码

虽然 mb_get_info() 本身不会直接读取文件的编码，但我们可以将它与其他函数（如 mb_detect_encoding()）结合使用，来处理文本文件的编码判断与转换。

步骤 1：读取文件内容

$content = file_get_contents('sample.txt');

步骤 2：检测编码类型

$encoding = mb_detect_encoding($content, mb_list_encodings(), true);
echo "原始编码：$encoding\n";

步骤 3：必要时转换为统一编码（例如 UTF-8）

if ($encoding !== 'UTF-8') {
    $content = mb_convert_encoding($content, 'UTF-8', $encoding);
    file_put_contents('sample_utf8.txt', $content);
    echo "已转换为 UTF-8 并保存到新文件。\n";
}