MongoDB中文全文搜索不再是难题，简单几步助你实现全文索引查询

2023-09-23 09:21:48

MongoDB 中文全文搜索：点亮数据检索之光

中文分词器的妙用

想要在 MongoDB 中实现中文全文搜索，中文分词器必不可少。它将中文文本巧妙地拆分为独立的单词或词组，为 MongoDB 提供索引和搜索所需的信息。所幸的是，MongoDB 社区早有先见之明，打造了诸如 jieba 和 SnowNLP 等多种中文分词器。你可以根据自己的需求，选择最适合的分词器。

创建 MongoDB 中文全文索引

有了中文分词器，下一步就是创建 MongoDB 中文全文索引。这是一种特殊索引，可对文本字段中的每个单词进行索引，使 MongoDB 能够快速搜索。创建全文索引的方法很简单，只需要在 MongoDB 命令行中执行以下命令：

db.collection.createIndex({字段名: "text"})

MongoDB 中文全文搜索查询

一切准备就绪后，就可以使用 MongoDB 进行中文全文搜索查询了。以下是一些常用的示例：

查找所有包含“中国”的文档：

db.collection.find({"$text": {"$search": "中国"}})

查找所有同时包含“中国”和“北京”的文档：

db.collection.find({"$text": {"$search": "中国 北京"}})

查找所有包含“中国”或“北京”的文档：

db.collection.find({"$text": {"$search": "中国 OR 北京"}})

代码示例

为了更好地理解，这里提供一段使用 MongoDB 实现中文全文搜索的代码示例：

// 引入必要库
const {MongoClient} = require('mongodb');

// 创建 MongoDB 客户端
const client = new MongoClient('mongodb://localhost:27017');

// 连接 MongoDB 数据库
client.connect((err, db) => {
  if (err) throw err;

  // 获取集合
  const collection = db.collection('myCollection');

  // 创建中文分词器
  const jieba = require('jieba');

  // 将中文文本分词
  const words = jieba.cut('你好，世界');

  // 创建全文索引
  collection.createIndex({字段名: "text"})

  // 进行中文全文搜索查询
  collection.find({"$text": {"$search": words.join(' ')}}, (err, result) => {
    if (err) throw err;

    // 打印搜索结果
    console.log(result);

    // 关闭 MongoDB 连接
    client.close();
  });
});