$linearFill (aggregation)
On this page本页内容
Definition定义
$linearFill
New in version 5.3.5.3版新增。Fills使用基于周围字段值的线性插值null
and missing fields in a window using linear interpolationbased on surrounding field values.
填充窗口中的空字段和缺失字段。
$linearFill
is only available in the仅在$setWindowFields
stage.$setWindowFields
阶段中可用。
Syntax语法
The $linearFill
expression has this syntax:$linearFill
表达式具有以下语法:
{ $linearFill: <expression> }
For more information on expressions, see Expressions.有关表达式的详细信息,请参阅表达式。
Behavior行为
$linearFill
fills null
and missing fields using linear interpolation based on surrounding non-
null
field values. $linearFill
使用基于周围非null
字段值的线性插值来填充
null
和缺失字段。The surrounding field values are determined by the sort order specified in 周围的字段值由$setWindowFields
.$setWindowFields
中指定的排序顺序决定。
$linearFill
fills按比例填充null
and missing values proportionally spanning the value range between surrounding non-null
values.null
和缺失值,这些值跨越周围非null
值之间的值范围。To determine the values for missing fields,要确定缺失字段的值,$linearFill
uses:$linearFill
使用:The difference of surrounding non-周围非null
values.null
值的差异。The number of要在周围值之间填充的空字段数。null
fields to fill between the surrounding values.
$linearFill
can fill multiple consecutive可以填充多个连续的null
values if those values are preceded and followed by non-null
values according to the sort order specified in$setWindowFields
.null
值,如果这些值前面和后面都是非null
值,则根据$setWindowFields
中指定的排序顺序。Example示例If a collection contains these documents:如果集合包含以下文档:{ index: 0, value: 0 },
{ index: 1, value: null },
{ index: 2, value: null },
{ index: 3, value: null },
{ index: 4, value: 10 }After using使用$linearFill
to fill thenull
values, the documents become:$linearFill
填充null
值后,文档变为:{ index: 0, value: 0 },
{ index: 1, value: 2.5 },
{ index: 2, value: 5 },
{ index: 3, value: 7.5 },
{ index: 4, value: 10 }前面和后面没有非null
values that are not preceded and followed by non-null
values remainnull
.null
值的null
值保持为null
。
Comparison of $fill
and $linearFill
$fill
和$linearFill
的比较
$fill
and $linearFill
To fill missing field values using linear interpolation要使用线性插值, you can use:
填充缺失的字段值,可以使用:
The带有$fill
stage with{ method: "linear" }
.{ method: "linear" }
的$fill
阶段。When you use the使用$fill
stage, the field you specify in the output is the same field used as the source data. See Fill Missing Field Values with Linear Interpolation.$fill
阶段时,在输出中指定的字段与源数据使用的字段相同。请参见使用线性插值填充缺少的字段值。The$linearFill
operator inside of a$setWindowFields
stage.$setWindowFields
阶段内的$linearFill
运算符。When you use the使用$linearFill
operator, you can set values for a different field than the field used as the source data. See Use Multiple Fill Methods in a Single Stage.$linearFill
运算符时,可以为不同于用作源数据的字段的字段设置值。请参见在单个阶段中使用多种填充方法。
Examples实例
The examples on this page use a 此页面上的示例使用了一个stock
collection that contains tracks a single company's stock price at hourly intervals:stock
集合,其中包含按小时间隔跟踪单个公司股价的信息:
db.stock.insertMany( [
{
time: ISODate("2021-03-08T09:00:00.000Z"),
price: 500
},
{
time: ISODate("2021-03-08T10:00:00.000Z"),
},
{
time: ISODate("2021-03-08T11:00:00.000Z"),
price: 515
},
{
time: ISODate("2021-03-08T12:00:00.000Z")
},
{
time: ISODate("2021-03-08T13:00:00.000Z")
},
{
time: ISODate("2021-03-08T14:00:00.000Z"),
price: 485
}
] )
The 集合中的某些文档缺少price
field is missing for some of the documents in the collection.price
字段。
Fill Missing Values with Linear Interpolation用线性插值填充缺失值
To populate the missing 要使用线性插值price
values using linear interpolation, use
$linearFill
inside of a $setWindowFields
stage:填充缺失的
price
值,请在$setWindowFields
阶段内使用$linearFill
:
db.stock.aggregate( [
{
$setWindowFields:
{
sortBy: { time: 1 },
output:
{
price: { $linearFill: "$price" }
}
}
}
] )
In the example:在示例中:
sortBy: { time: 1 }
sorts the documents by the按照time
field in ascending order, from earliest to latest.time
字段从最早到最晚的升序对文档进行排序。output
specifies:指定:price
as the field for which to fill in missing values.作为要填充缺失值的字段。{ $linearFill: "$price" }
as the value for the missing field.作为缺失字段的值。$linearFill
fills missing使用基于序列中周围price
values using linear interpolationbased on the surrounding
price
values in the sequence.price
值的线性插值来填充缺失的
price
值。
Example output:示例输出:
[
{
_id: ObjectId("620ad555394d47411658b5ef"),
time: ISODate("2021-03-08T09:00:00.000Z"),
price: 500
},
{
_id: ObjectId("620ad555394d47411658b5f0"),
time: ISODate("2021-03-08T10:00:00.000Z"),
price: 507.5
},
{
_id: ObjectId("620ad555394d47411658b5f1"),
time: ISODate("2021-03-08T11:00:00.000Z"),
price: 515
},
{
_id: ObjectId("620ad555394d47411658b5f2"),
time: ISODate("2021-03-08T12:00:00.000Z"),
price: 505
},
{
_id: ObjectId("620ad555394d47411658b5f3"),
time: ISODate("2021-03-08T13:00:00.000Z"),
price: 495
},
{
_id: ObjectId("620ad555394d47411658b5f4"),
time: ISODate("2021-03-08T14:00:00.000Z"),
price: 485
}
]
Use Multiple Fill Methods in a Single Stage在单个阶段中使用多种填充方法
When you use the 当您使用$setWindowFields
stage to fill missing values, you can set values for a different field than the field you fill from. $setWindowFields
阶段来填充缺失的值时,您可以为不同于从中填充的字段的字段设置值。As a result, you can use multiple fill methods in a single 因此,您可以在单个$setWindowFields
stage and output the results in distinct fields.$setWindowFields
阶段中使用多个填充方法,并在不同的字段中输出结果。
The following pipeline populates missing 以下管道使用线性插值price
fields using linear interpolation and the last-observation-carried-forward method:
和最后一次观测结转方法填充缺失的
price
字段:
db.stock.aggregate( [
{
$setWindowFields:
{
sortBy: { time: 1 },
output:
{
linearFillPrice: { $linearFill: "$price" },
locfPrice: { $locf: "$price" }
}
}
}
] )
In the example:在示例中:
sortBy: { time: 1 }
sorts the documents by the按照time
field in ascending order, from earliest to latest.time
字段从最早到最晚的升序对文档进行排序。output
specifies:指定:linearFillPrice
as a target field to be filled.作为要填充的目标字段。{ $linearFill: "$price" }
is the value for the是linearFillPrice
field.linearFillPrice
字段的值。$linearFill
fills missingprice
values using linear interpolationbased on the surrounding
price
values in the sequence.$linearFill
使用基于序列中周围price
值的线性插值来填充缺失的价格值。
locfPrice
as a target field to be filled.作为要填充的目标字段。{ $locf: "$price" }
is the value for the是locfPrice
field.locfPrice
字段的值。locf
stands for last observation carried forward.代表最后一次观测。$locf
fills missing用序列中上一个文档的值填充缺少的price
values with the value from the previous document in the sequence.price
值。
Example output:示例输出:
[
{
_id: ObjectId("620ad555394d47411658b5ef"),
time: ISODate("2021-03-08T09:00:00.000Z"),
price: 500,
linearFillPrice: 500,
locfPrice: 500
},
{
_id: ObjectId("620ad555394d47411658b5f0"),
time: ISODate("2021-03-08T10:00:00.000Z"),
linearFillPrice: 507.5,
locfPrice: 500
},
{
_id: ObjectId("620ad555394d47411658b5f1"),
time: ISODate("2021-03-08T11:00:00.000Z"),
price: 515,
linearFillPrice: 515,
locfPrice: 515
},
{
_id: ObjectId("620ad555394d47411658b5f2"),
time: ISODate("2021-03-08T12:00:00.000Z"),
linearFillPrice: 505,
locfPrice: 515
},
{
_id: ObjectId("620ad555394d47411658b5f3"),
time: ISODate("2021-03-08T13:00:00.000Z"),
linearFillPrice: 495,
locfPrice: 515
},
{
_id: ObjectId("620ad555394d47411658b5f4"),
time: ISODate("2021-03-08T14:00:00.000Z"),
price: 485,
linearFillPrice: 485,
locfPrice: 485
}
]
Restrictions限制
To use若要使用$linearFill
, you must use the sortBy field to sort your data.$linearFill
,必须使用sortBy
字段对数据进行排序。When using使用$linearFill
window function,$setWindowFields
returns an error if there are any repeated values in the sortBy field in a single partition.$linearFill
窗口函数时,如果单个分区中的sortBy
字段中有任何重复值,则$setWindowFields
将返回一个错误。