Database Manual / Reference / Query Language / CRUD Commands

`aggregate` (database command)（数据库命令）

Definition定义

aggregate: ~~Performs aggregation operation using the aggregation pipeline. The pipeline allows users to process data from a collection or other source with a sequence of stage-based manipulations.~~使用聚合管道执行聚合操作。该管道允许用户通过一系列基于阶段的操作来处理来自集合或其他来源的数据。

Tip

~~In mongosh, this command can also be run through the db.aggregate() and db.collection.aggregate() helper methods or with the watch() helper method.~~在mongosh中，此命令也可以通过db.aggregate()和db.collection.aggregate()辅助方法或watch()辅助方法运行。

~~Helper methods are convenient for mongosh users, but they may not return the same level of information as database commands.~~ 助手方法对mongosh用户来说很方便，但它们可能不会返回与数据库命令相同级别的信息。~~In cases where the convenience is not needed or the additional return fields are required, use the database command.~~如果不需要便利性或需要额外的返回字段，请使用数据库命令。

Compatibility兼容性

~~This command is available in deployments hosted in the following environments:~~此命令在以下环境中托管的部署中可用：

MongoDB Atlas~~: The fully managed service for MongoDB deployments in the cloud~~：云中MongoDB部署的完全托管服务

Important

~~This command has limited support in M0 and Flex clusters. For more information, see Unsupported Commands.~~此命令在M0和Flex集群中的支持有限。有关详细信息，请参阅不支持的命令。

MongoDB ~~Enterprise~~企业版~~: The subscription-based, self-managed version of MongoDB~~：MongoDB的基于订阅的自我管理版本
MongoDB ~~Community~~社区版~~: The source-available, free-to-use, and self-managed version of MongoDB~~：MongoDB的源代码可用、免费使用和自我管理版本

Syntax语法

Changed in version 5.0.在5.0版本中更改。

~~The command has the following syntax:~~该命令具有以下语法：

db.runCommand(
   {
     aggregate: "<collection>" || 1,
     pipeline: [ <stage>, <...> ],
     explain: <boolean>,
     allowDiskUse: <boolean>,
     cursor: <document>,
     maxTimeMS: <int>,
     bypassDocumentValidation: <boolean>,
     readConcern: <document>,
     collation: <document>,
     hint: <string or document>,
     comment: <any>,
     writeConcern: <document>,
     let: <document> // Added in MongoDB 5.0
   }
)

Command Fields命令字段

~~The aggregate command takes the following fields as arguments:~~aggregate命令接受以下字段作为参数：

~~Field~~字段	~~Type~~类型	~~Description~~描述
`aggregate`	string	~~The name of the collection or view that acts as the input for the aggregation pipeline. Use `1` for collection agnostic commands.~~作为聚合管道输入的集合或视图的名称。使用`1`表示与集合无关的命令。
`pipeline`	array	~~An array of aggregation pipeline stages that process and transform the document stream as part of the aggregation pipeline.~~一系列聚合管道级，作为聚合管道的一部分处理和转换文档流。
`explain`	boolean	~~Optional. Specifies to return the information on the processing of the pipeline.~~可选。指定返回有关管道处理的信息。 ~~Not available in multi-document transactions.~~在多文档事务中不可用。
`allowDiskUse`	boolean	~~Optional.~~可选。 ~~Use this option to override `allowDiskUseByDefault` for a specific query. You can use this option to either:~~使用此选项可覆盖特定查询的`allowDiskUseByDefault`。您可以使用此选项： ~~Prohibit disk use on a system where disk use is allowed by default.~~禁止在默认情况下允许使用磁盘的系统上使用磁盘。 ~~Allow disk use on a system where disk use is prohibited by default.~~允许在默认情况下禁止使用磁盘的系统上使用磁盘。 Starting in MongoDB 6.0, if `allowDiskUseByDefault` is set to `true` and the server requires more than 100 megabytes of memory for a pipeline execution stage, MongoDB automatically writes temporary files to disk unless the query specifies `{ allowDiskUse: false }`.从MongoDB 6.0开始，如果`allowDiskUseByDefault`设置为`true`，并且服务器在管道执行阶段需要超过100兆字节的内存，MongoDB会自动将临时文件写入磁盘，除非查询指定`{ allowDiskUse: false }`。 ~~For details, see `allowDiskUseByDefault`.~~有关详细信息，请参阅`allowDiskUseByDefault`。 ~~The profiler log messages and diagnostic log messages includes a `usedDisk` indicator if any aggregation stage wrote data to temporary files due to memory restrictions.~~如果由于内存限制，任何聚合阶段将数据写入临时文件，分析器日志消息和诊断日志消息都会包含一个`usedDisk`指示符。
`cursor`	document	~~Specify a document that contains options that control the creation of the cursor object.~~指定一个包含控制游标对象创建的选项的文档。 ~~You must use the `aggregate` command with the `cursor` option unless the command includes the `explain` option.~~除非命令中包含`explain`选项，否则必须将`aggregate`命令与`cursor`选项一起使用。 ~~To indicate a cursor with the default batch size, specify `cursor: {}`.~~要使用默认批处理大小指示游标，请指定`cursor: {}`。 ~~To indicate a cursor with a non-default batch size, use `cursor: { batchSize: <num> }`.~~要指示具有非默认批大小的游标，请使用`cursor: { batchSize: <num> }`。
`maxTimeMS`	non-negative integer	~~Optional.~~可选。 ~~Specifies a time limit in milliseconds. If you do not specify a value for `maxTimeMS`, operations will not time out. A value of `0` explicitly specifies the default unbounded behavior.~~指定以毫秒为单位的时间限制。如果不指定`maxTimeMS`的值，操作将不会超时。值`0`明确指定默认的无界行为。 ~~MongoDB terminates operations that exceed their allotted time limit using the same mechanism as `db.killOp()`. MongoDB only terminates an operation at one of its designated interrupt points.~~MongoDB使用与`db.killOp()`相同的机制终止超过分配时间限制的操作。MongoDB仅在其指定的中断点之一终止操作。
`bypassDocumentValidation`	boolean	~~Optional. Applicable only if you specify the `$out` or `$merge` aggregation stages.~~可选。仅当指定`$out`或`$merge`聚合阶段时才适用。 ~~Enables `aggregate` to bypass schema validation during the operation. This lets you insert documents that do not meet the validation requirements.~~允许`aggregate`在操作过程中绕过架构验证。这允许您插入不符合验证要求的文档。
`readConcern`	document	~~Optional. Specifies the read concern.~~可选。指定读取关注。 ~~The `readConcern` option has the following syntax: `readConcern: { level: <value> }`~~`readConcern`选项具有以下语法：`readConcern: { level: <value> }` ~~Possible read concern levels are:~~可能的读取关注级别包括： `"local"`~~. This is the default read concern level for read operations against the primary and secondaries.~~。这是针对主要和次要读取操作的默认读取关注级别。 `"available"`~~. Available for read operations against the primary and secondaries.~~ 。可用于对初级和次级进行读取操作。`"available"` ~~behaves the same as `"local"` against the primary and non-sharded secondaries. The query returns the instance's most recent data.~~对primary和非分片次级的行为与`"local"`相同。查询返回实例的最新数据。 `"majority"`~~. Available for replica sets that use WiredTiger storage engine.~~。适用于使用WiredTiger存储引擎的副本集。 `"linearizable"`~~. Available for read operations on the `primary` only.~~。仅适用于`primary`上的读取操作。 `"snapshot"`~~. Available for multi-document transactions and certain read operations outside of multi-document transactions.~~。可用于多文档事务和多文档事务之外的某些读取操作。 ~~For more formation on the read concern levels, see Read Concern Levels.~~有关读取关注级别的更多信息，请参阅读取关注级别。 ~~The `$out` stage cannot be used in conjunction with read concern `"linearizable"`.~~ `$out`阶段不能与读取关注`"linearizable"`结合使用。~~If you specify `"linearizable"` read concern for `db.collection.aggregate()`, you cannot include the `$out` stage in the pipeline.~~如果为`db.collection.aggregate()`指定`"linearizable"`读取关注，则不能在管道中包含`$out`阶段。 ~~The `$merge` stage cannot be used in conjunction with read concern `"linearizable"`.~~ `$merge`阶段不能与读取关注`"linearizable"`结合使用。~~That is, if you specify `"linearizable"` read concern for `db.collection.aggregate()`, you cannot include the `$merge` stage in the pipeline.~~也就是说，如果为`db.collection.aggregate()`指定`"linearizable"`读取关注，则不能在管道中包含`$merge`阶段。
`collation`	document	~~Optional.~~可选。 ~~Specifies the collation to use for the operation.~~指定用于操作的排序规则。 ~~Collation allows users to specify language-specific rules for string comparison, such as rules for lettercase and accent marks.~~排序规则允许用户为字符串比较指定特定于语言的规则，例如字母大小写和重音标记的规则。 ~~The collation option has the following syntax:~~排序规则选项具有以下语法： `collation: { locale: <string>, caseLevel: <boolean>, caseFirst: <string>, strength: <int>, numericOrdering: <boolean>, alternate: <string>, maxVariable: <string>, backwards: <boolean> }` ~~When specifying collation, the `locale` field is mandatory; all other collation fields are optional. For descriptions of the fields, see Collation Document.~~指定排序规则时，`locale`字段是必填的；所有其他排序字段都是可选的。有关字段的描述，请参阅排序规则文档。 ~~If the collation is unspecified but the collection has a default collation (see `db.createCollection()`), the operation uses the collation specified for the collection.~~如果未指定排序规则，但集合具有默认排序规则（请参阅`db.createCollection()`），则操作将使用为集合指定的排序规则。 ~~If no collation is specified for the collection or for the operations, MongoDB uses the simple binary comparison used in prior versions for string comparisons.~~如果没有为集合或操作指定排序规则，MongoDB将使用以前版本中用于字符串比较的简单二进制比较。 You cannot specify multiple collations for an operation. For example, you cannot specify different collations per field, or if performing a find with a sort, you cannot use one collation for the find and another for the sort.不能为操作指定多个排序规则。例如，您不能为每个字段指定不同的排序规则，或者如果使用排序执行查找，则不能对查找使用一个排序规则，对排序使用另一个。
`hint`	string or document	~~Optional. The index to use for the aggregation. The index is on the initial collection/view against which the aggregation is run.~~可选。用于聚合的索引。索引位于运行聚合的初始集合/视图上。 ~~Specify the index either by the index name or by the index specification document.~~通过索引名称或索引规范文档指定索引。 ~~The `hint` does not apply to `$lookup` and `$graphLookup` stages.~~该`hint`不适用于`$lookup`和`$graphLookup`阶段。
`comment`	any	~~Optional. A user-provided comment to attach to this command. Once set, this comment appears alongside records of this command in the following locations:~~可选。用户提供了要附加到此命令的注释。设置后，此注释将与此命令的记录一起出现在以下位置： mongod ~~log messages~~日志消息~~, in the `attr.command.cursor.comment` field.~~在`attr.command.cursor.comment`字段中。 ~~Database profiler~~数据库分析器 ~~output, in the `command.comment` field.~~输出，在`command.comment`字段中。 `currentOp` ~~output, in the `command.comment` field.~~输出，在`command.comment`字段中。 ~~A comment can be any valid BSON type (string, integer, object, array, etc).~~注释可以是任何有效的BSON类型（字符串、整数、对象、数组等）。 ~~Any comment set on a `aggregate` command is inherited by any subsequent `getMore` commands run on the `aggregate` cursor.~~在`aggregate`命令上设置的任何注释都会被在`aggregate`游标上运行的任何后续`getMore`命令继承。
`writeConcern`	document	~~Optional.~~ 可选。~~A document that expresses the write concern to use with the `$out` or `$merge` stage.~~表达与`$out`或`$merge`阶段一起使用的写入关注的文档。 ~~Omit to use the default write concern with the `$out` or `$merge` stage.~~省略在`$out`或`$merge`阶段使用默认写入关注。
`let`	document	~~Optional.~~可选。 ~~Specifies a document with a list of variables. This allows you to improve command readability by separating the variables from the query text.~~指定一个包含变量列表的文档。这允许您通过将变量与查询文本分离来提高命令的可读性。 ~~The document syntax is:~~文档语法为： `{ <variable_name_1>: <expression_1>, ..., <variable_name_n>: <expression_n> }` ~~The variable is set to the value returned by the expression, and cannot be changed afterwards.~~变量设置为表达式返回的值，之后不能更改。 ~~To access the value of a variable in the command, use the double dollar sign prefix (`$$`) together with your variable name in the form `$$<variable_name>`.~~ 要在命令中访问变量的值，请使用双美元符号前缀（`$$`）和变量名，格式为`$$<variable_name>`。~~For example: `$$targetTotal`.~~例如：`$$targetTotal`。 ~~To use a variable to filter results in a pipeline `$match` stage, you must access the variable within the `$expr` operator.~~要在管道`$match`阶段使用变量筛选结果，您必须在`$expr`运算符中访问该变量。 ~~For a complete example using `let` and variables, see Use Variables in `let`.~~有关使用`let`和变量的完整示例，请参阅在`let`中使用变量。 ~~~~New in version 5.0.~~版本5.0中的新功能。~~5.0版本中的新功能。

~~You must use the aggregate command with the cursor option unless the command includes the explain option.~~除非命令中包含explain选项，否则必须将aggregate命令与cursor选项一起使用。

~~To indicate a cursor with the default batch size, specify cursor: {}.~~要使用默认批处理大小指示游标，请指定cursor: {}。
~~To indicate a cursor with a non-default batch size, use cursor: { batchSize: <num> }.~~要指示具有非默认批大小的游标，请使用cursor: { batchSize: <num> }。

~~For more information about the aggregation pipeline, see:~~有关聚合管道的更多信息，请参阅：

Sessions会话

~~For cursors created inside a session, you cannot call getMore outside the session.~~对于在会话内创建的游标，不能在会话外调用getMore。

~~Similarly, for cursors created outside of a session, you cannot call getMore inside a session.~~同样，对于在会话外创建的游标，您不能在会话内调用getMore。

Session Idle Timeout会话空闲超时

~~MongoDB drivers and mongosh associate all operations with a server session, with the exception of unacknowledged write operations.~~ MongoDB驱动程序和mongosh将所有操作与服务器会话相关联，但未确认的写入操作除外。~~For operations not explicitly associated with a session (i.e. using Mongo.startSession()), MongoDB drivers and mongosh create an implicit session and associate it with the operation.~~对于未明确与会话关联的操作（即使用Mongo.startSession()），MongoDB驱动程序和mongosh会创建一个隐式会话并将其与操作关联。

If a session is idle for longer than 30 minutes, the MongoDB server marks that session as expired and may close it at any time. When the MongoDB server closes the session, it also kills any in-progress operations and open cursors associated with the session. 如果会话空闲时间超过30分钟，MongoDB服务器会将该会话标记为已过期，并可能随时将其关闭。当MongoDB服务器关闭会话时，它还会终止任何正在进行的操作并打开与会话关联的游标。~~This includes cursors configured with noCursorTimeout() or a maxTimeMS() greater than 30 minutes.~~这包括配置了noCursorTimeout()或大于30分钟的maxTimeMS()的游标。

For operations that return a cursor, if the cursor may be idle for longer than 30 minutes, issue the operation within an explicit session using Mongo.startSession() and periodically refresh the session using the refreshSessions command. 对于返回游标的操作，如果游标可能空闲超过30分钟，请使用Mongo.startSession()在显式会话中发出该操作，并使用refreshSessions命令定期刷新会话。See Session Idle Timeout for more information.

Transactions事务

~~aggregate can be used inside distributed transactions.~~aggregate可以在分布式事务中使用。

~~However, the following stages are not allowed within transactions:~~但是，事务中不允许出现以下阶段：

~~You also cannot specify the explain option.~~您也不能指定explain选项。

~~For cursors created outside of a transaction, you cannot call getMore inside the transaction.~~对于在事务外部创建的游标，您不能在事务内部调用getMore。
~~For cursors created in a transaction, you cannot call getMore outside the transaction.~~对于在事务中创建的游标，您不能在事务外部调用getMore。

Important

In most cases, a distributed transaction incurs a greater performance cost over single document writes, and the availability of distributed transactions should not be a replacement for effective schema design. 在大多数情况下，分布式事务比单文档写入产生更大的性能成本，分布式事务的可用性不应取代有效的模式设计。~~For many scenarios, the denormalized data model (embedded documents and arrays) will continue to be optimal for your data and use cases.~~ 对于许多场景，非规范化数据模型（嵌入式文档和数组）将继续是您的数据和用例的最佳选择。~~That is, for many scenarios, modeling your data appropriately will minimize the need for distributed transactions.~~也就是说，对于许多场景，适当地对数据进行建模将最大限度地减少对分布式事务的需求。

~~For additional transactions usage considerations (such as runtime limit and oplog size limit), see also Production Considerations.~~有关其他事务使用注意事项（如运行时限制和oplog大小限制），另请参阅生产注意事项。

Client Disconnection客户端断开连接

~~For aggregate operation that do not include the $out or $merge stages:~~对于不包括$out或$merge阶段的聚合操作：

~~If the client that issued aggregate disconnects before the operation completes, MongoDB marks aggregate for termination using killOp.~~如果发出aggregate的客户端在操作完成之前断开连接，MongoDB将使用killOp标记 aggregate终止。

Query Settings查询设置

~~~~New in version 8.0.~~版本8.0中的新功能。~~8.0版本中的新功能。

~~You can use query settings to set index hints, set operation rejection filters, and other fields.~~ 您可以使用查询设置来设置索引提示、设置操作拒绝筛选器和其他字段。~~The settings apply to the query shape on the entire cluster.~~ 这些设置适用于整个集群上的查询形状。~~The cluster retains the settings after shutdown.~~集群在关闭后保留设置。

~~The query optimizer uses the query settings as an additional input during query planning, which affects the plan selected to run the query. You can also use query settings to block a query shape.~~查询优化器在查询规划期间使用查询设置作为额外输入，这会影响为运行查询而选择的计划。您还可以使用查询设置来阻止查询形状。

~~To add query settings and explore examples, see setQuerySettings.~~要添加查询设置并探索示例，请参阅setQuerySettings。

~~You can add query settings for find, distinct, and aggregate commands.~~您可以为find、distinct和aggregate命令添加查询设置。

~~Query settings have more functionality and are preferred over deprecated index filters.~~查询设置具有更多功能，并且优于已弃用的索引筛选器。

~~To remove query settings, use removeQuerySettings.~~ 要删除查询设置，请使用removeQuerySettings。~~To obtain the query settings, use a $querySettings stage in an aggregation pipeline.~~要获取查询设置，请在聚合管道中使用$querySettings阶段。

Stable 稳定API

~~When using Stable API V1:~~当使用稳定API V1:

~~You cannot use the following stages in an aggregate command:~~您不能在aggregate命令中使用以下阶段：
~~Don't include the explain field in an aggregate command. If you do, the server returns an APIStrictError error.~~不要在aggregate命令中包含explain字段。如果这样做，服务器将返回APIStrictError错误。
~~When using the $collStats stage, you can only use the count field. No other $collStats fields are available.~~使用$collStats阶段时，您只能使用count字段。没有其他$collStats字段可用。

Example示例

~~To indicate a cursor with the default batch size, specify cursor: {}.~~要使用默认批处理大小指示游标，请指定cursor: {}。
~~To indicate a cursor with a non-default batch size, use cursor: { batchSize: <num> }.~~要指示具有非默认批大小的游标，请使用cursor: { batchSize: <num> }。

~~Rather than run the aggregate command directly, most users should use the db.collection.aggregate() helper provided in mongosh or the equivalent helper in their driver.~~ 大多数用户应该使用mongosh中提供的db.collection.aggregate()辅助程序或其驱动程序中的等效辅助程序，而不是直接运行aggregate命令。~~In 2.6 and later, the db.collection.aggregate() helper always returns a cursor.~~在2.6及更高版本中，db.collection.aggregate()辅助函数总是返回一个游标。

~~Except for the first two examples which demonstrate the command syntax, the examples in this page use the db.collection.aggregate() helper.~~除了演示命令语法的前两个示例外，本页中的示例都使用db.collection.aggregate()助手。

Aggregate Data with Multi-Stage Pipeline使用多级管道聚合数据

~~A collection articles contains documents such as the following:~~集合articles包含以下文档：

{
   _id: ObjectId("52769ea0f3dc6ead47c9a1b2"),
   author: "abc123",
   title: "zzz",
   tags: [ "programming", "database", "mongodb" ]
}

~~The following example performs an aggregate operation on the articles collection to calculate the count of each distinct element in the tags array that appears in the collection.~~以下示例对articles集合执aggregate操作，以计算集合中出现的tags数组中每个不同元素的计数。

db.runCommand( {
   aggregate: "articles",
   pipeline: [
      { $project: { tags: 1 } },
      { $unwind: "$tags" },
      { $group: { _id: "$tags", count: { $sum : 1 } } }
   ],
   cursor: { }
} )

~~In mongosh, this operation can use the db.collection.aggregate() helper as in the following:~~在mongosh中，此操作可以使用db.collection.aggregate()辅助函数，如下所示：

db.articles.aggregate( [
   { $project: { tags: 1 } },
   { $unwind: "$tags" },
   { $group: { _id: "$tags", count: { $sum : 1 } } }
] )

Use $currentOp on an Admin Database在admin数据库上使用`$currentOp`

~~The following example runs a pipeline with two stages on the admin database.~~ 以下示例在管理数据库上运行一个包含两个阶段的管道。~~The first stage runs the $currentOp operation and the second stage filters the results of that operation.~~第一阶段运行$currentOp操作，第二阶段筛选该操作的结果。

db.adminCommand( {
   aggregate : 1,
   pipeline : [ {
      $currentOp : { allUsers : true, idleConnections : true } }, {
      $match : { shard : "shard01" }
      }
   ],
   cursor : { }
} )

Note

~~The aggregate command does not specify a collection and instead takes the form {aggregate: 1}.~~ aggregate命令没有指定集合，而是采用{aggregate: 1}的形式。~~This is because the initial $currentOp stage does not draw input from a collection.~~ 这是因为初始的$currentOp阶段不从集合中获取输入。~~It produces its own data that the rest of the pipeline uses.~~它生成自己的数据，供管道的其他部分使用。

~~The new db.aggregate() helper has been added to assist in running collectionless aggregations such as this.~~ 添加了新的db.aggregate()辅助程序，以帮助运行此类无集体聚合。~~The above aggregation could also be run like this example.~~上述聚合也可以像这个例子一样运行。

Return Information on the Aggregation Operation返回聚合操作的信息

~~The following aggregation operation sets the optional field explain to true to return information about the aggregation operation.~~以下聚合操作将可选字段explain设置为true，以返回有关聚合操作的信息。

db.orders.aggregate([
      { $match: { status: "A" } },
      { $group: { _id: "$cust_id", total: { $sum: "$amount" } } },
      { $sort: { total: -1 } }
   ],
   { explain: true }
)

Note

The explain output is subject to change between releases.解释输出可能会在不同版本之间发生变化。

Tip

db.collection.aggregate() ~~method~~方法

Interaction with `allowDiskUseByDefault`与`allowDiskUseByDefault`的交互

Starting in MongoDB 6.0, pipeline stages that require more than 100 megabytes of memory to execute write temporary files to disk by default. These temporary files last for the duration of the pipeline execution and can influence storage space on your instance. 从MongoDB 6.0开始，默认情况下，需要超过100兆字节内存来执行将临时文件写入磁盘的流水线阶段。这些临时文件在管道执行期间持续存在，并可能影响实例上的存储空间。~~In earlier versions of MongoDB, you must pass { allowDiskUse: true } to individual find and aggregate commands to enable this behavior.~~在早期版本的MongoDB中，您必须将{ allowDiskUse: true }传递给单个find和aggregate命令以启用此行为。

~~Individual find and aggregate commands can override the allowDiskUseByDefault parameter by either:~~单个find和aggregate命令可以通过以下任一方式覆盖allowDiskUseByDefault参数：

~~Using { allowDiskUse: true } to allow writing temporary files out to disk when allowDiskUseByDefault is set to false~~当allowDiskUseByDefault设置为false时，使用{ allowDiskUse: true }允许将临时文件写入磁盘
~~Using { allowDiskUse: false } to prohibit writing temporary files out to disk when allowDiskUseByDefault is set to true~~当allowDiskUseByDefault设置为true时，使用{ allowDiskUse: false }禁止将临时文件写入磁盘

~~The profiler log messages and diagnostic log messages includes a usedDisk indicator if any aggregation stage wrote data to temporary files due to memory restrictions.~~如果由于内存限制，任何聚合阶段将数据写入临时文件，分析器日志消息和诊断日志消息都会包含一个usedDisk指示符。

Tip

Aggregate Data Specifying Batch Size指定批大小的聚合数据

~~To specify an initial batch size, specify the batchSize in the cursor field, as in the following example:~~要指定初始批大小，请在cursor字段中指定batchSize，如下例所示：

db.orders.aggregate( [
      { $match: { status: "A" } },
      { $group: { _id: "$cust_id", total: { $sum: "$amount" } } },
      { $sort: { total: -1 } },
      { $limit: 2 }
   ],
   { cursor: { batchSize: 0 } }
)

The { cursor: { batchSize: 0 } } document, which specifies the size of the initial batch size, indicates an empty first batch. This batch size is useful for quickly returning a cursor or failure message without doing significant server-side work.{ cursor: { batchSize: 0 } }文档指定了初始批大小的大小，表示第一批为空。此批大小对于快速返回游标或失败消息非常有用，而无需进行大量的服务器端工作。

~~To specify batch size for subsequent getMore operations (after the initial batch), use the batchSize field when running the getMore command.~~要为后续的getMore操作（在初始批处理之后）指定批处理大小，请在运行getMore命令时使用batchSize字段。

Specify a Collation指定排序规则

~~Collation allows users to specify language-specific rules for string comparison, such as rules for lettercase and accent marks.~~排序规则允许用户为字符串比较指定特定于语言的规则，例如字母大小写和重音标记的规则。

~~A collection myColl has the following documents:~~myColl集合有以下文件：

{ _id: 1, category: "café", status: "A" }
{ _id: 2, category: "cafe", status: "a" }
{ _id: 3, category: "cafE", status: "a" }

~~The following aggregation operation includes the Collation option:~~以下聚合操作包括Collation选项：

db.myColl.aggregate(
   [ { $match: { status: "A" } }, { $group: { _id: "$category", count: { $sum: 1 } } } ],
   { collation: { locale: "fr", strength: 1 } }
);

~~For descriptions on the collation fields, see Collation Document.~~有关排序规则字段的说明，请参阅排序规则文档。

Hint an Index提示索引

~~Create a collection foodColl with the following documents:~~使用以下文档创建集合foodColl：

db.foodColl.insertMany( [
   { _id: 1, category: "cake", type: "chocolate", qty: 10 },
   { _id: 2, category: "cake", type: "ice cream", qty: 25 },
   { _id: 3, category: "pie", type: "boston cream", qty: 20 },
   { _id: 4, category: "pie", type: "blueberry", qty: 15 }
] )

~~Create the following indexes:~~创建以下索引：

db.foodColl.createIndex( { qty: 1, type: 1 } );
db.foodColl.createIndex( { qty: 1, category: 1 } );

~~The following aggregation operation includes the hint option to force the usage of the specified index:~~以下聚合操作包括强制使用指定索引的hint选项：

db.foodColl.aggregate(
   [ { $sort: { qty: 1 }}, { $match: { category: "cake", qty: 10  } }, { $sort: { type: -1 } } ],
   { hint: { qty: 1, category: 1 } }
)

Override Default Read Concern覆盖默认读取关注

~~To override the default read concern level, use the readConcern option.~~ 要覆盖默认的读取关注级别，请使用readConcern选项。~~The getMore command uses the readConcern level specified in the originating aggregate command.~~getMore命令使用原始聚合命令中指定的readConcern级别。

~~You cannot use the $out or the $merge stage in conjunction with read concern "linearizable".~~ 您不能将$out或$merge阶段与读取关注"linearizable"结合使用。~~That is, if you specify "linearizable" read concern for db.collection.aggregate(), you cannot include either stages in the pipeline.~~也就是说，如果为db.collection.aggregate()指定"linearizable"读取关注，则不能在管道中包含任何阶段。

~~The following operation on a replica set specifies a read concern of "majority" to read the most recent copy of the data confirmed as having been written to a majority of the nodes.~~对副本集的以下操作指定了"majority"读取关注，以读取确认已写入大多数节点的数据的最新副本。

Important

~~You can specify read concern level "majority" for an aggregation that includes an $out stage.~~您可以为包含$out阶段的聚合指定读取关注级别"majority"。
~~Regardless of the read concern level, the most recent data on a node may not reflect the most recent version of the data in the system.~~无论读取关注级别如何，节点上的最新数据可能不会反映系统中数据的最新版本。

db.restaurants.aggregate(
   [ { $match: { rating: { $lt: 5 } } } ],
   { readConcern: { level: "majority" } }
)

~~To ensure that a single thread can read its own writes, use "majority" read concern and "majority" write concern against the primary of the replica set.~~为了确保单个线程可以读取自己的写入，请对副本集的主线程使用"majority"读取关注和"majority"写入关注。

Use Variables in `let`在`let`中使用变量

~~~~New in version 5.0.~~版本5.0中的新功能。~~5.0版本中的新功能。

~~To define variables that you can access elsewhere in the command, use the let option.~~要定义可以在命令的其他地方访问的变量，请使用let选项。

Note

~~To filter results using a variable in a pipeline $match stage, you must access the variable within the $expr operator.~~要使用管道$match阶段中的变量筛选结果，您必须在$expr运算符中访问该变量。

~~Create a collection cakeSales containing sales for cake flavors:~~创建一个包含蛋糕口味销售的cakeSales集合：

db.cakeSales.insertMany( [
   { _id: 1, flavor: "chocolate", salesTotal: 1580 },
   { _id: 2, flavor: "strawberry", salesTotal: 4350 },
   { _id: 3, flavor: "cherry", salesTotal: 2150 }
] )

~~The following example:~~以下示例：

~~retrieves the cake that has a salesTotal greater than 3000, which is the cake with an _id of 2~~检索salesTotal大于3000的蛋糕，即_id为2的蛋糕
~~defines a targetTotal variable in let, which is referenced in $gt as $$targetTotal~~在let中定义一个targetTotal变量，在$gt中引用为$$targetTotal

db.runCommand( {
   aggregate: db.cakeSales.getName(),
   pipeline: [
      { $match: {
           $expr: { $gt: [ "$salesTotal", "$$targetTotal" ] }
      } },
    ],
    cursor: {},
    let: { targetTotal: 3000 }
} )

Tip

db.collection.aggregate()

Back

~~CRUD Commands~~CRUD命令

bulkWrite

aggregate (database command)（数据库命令）

Definition定义

Tip

Compatibility兼容性

Important

Syntax语法

Command Fields命令字段

Sessions会话

Session Idle Timeout会话空闲超时

Transactions事务

Important

Client Disconnection客户端断开连接

Query Settings查询设置

Stable 稳定API

Example示例

Aggregate Data with Multi-Stage Pipeline使用多级管道聚合数据

Use $currentOp on an Admin Database在admin数据库上使用$currentOp

Note

Return Information on the Aggregation Operation返回聚合操作的信息

Note

The explain output is subject to change between releases.解释输出可能会在不同版本之间发生变化。

Tip

Interaction with allowDiskUseByDefault与allowDiskUseByDefault的交互

Tip

Aggregate Data Specifying Batch Size指定批大小的聚合数据

Specify a Collation指定排序规则

Hint an Index提示索引

Override Default Read Concern覆盖默认读取关注

Important

Use Variables in let在let中使用变量

Note

Tip

`aggregate` (database command)（数据库命令）

Use $currentOp on an Admin Database在admin数据库上使用`$currentOp`

Interaction with `allowDiskUseByDefault`与`allowDiskUseByDefault`的交互

Use Variables in `let`在`let`中使用变量