The Internet of Things on AWS – Official Blog
Getting Aggregate Information of Devices with AWS IoT Device Management Fleet Indexing
AWS IoT Device Management Fleet Indexing has launched three APIs for getting aggregate information of devices: GetStatistics
, GetPercentiles
and GetCardinality
. With these APIs, you can quickly answer questions like: What’s the percentage of the devices connecting to AWS IoT? What’s the maximum, minimum and average of device battery level? How many OS versions are present?
In this blog post, we will walk you through a data center temperature monitoring simulation to understand how to use Fleet Indexing APIs to get aggregate information of the devices. A data center may have many sensors monitoring server temperatures. You will learn how to use GetStatistics
to get the min/max/avg temperature; use GetPercentiles
to get the 90th percentile of temperature; use GetCardinality
to get the number of server racks with abnormal temperatures.
Set Up
Ensure you have the latest version of the AWS CLI installed.
Assuming you are operating a data center with 10 temperature sensors connected to AWS IoT, distributed among 6 server racks. Some sensors are reporting abnormal temperatures. Let’s use the script below to create 10 things representing the sensors. Each thing will have three attributes:
- temperature: Number type – temperature value in Fahrenheit. Note that using thing attribute here is for simplicity of demonstration only. For best practices, please use AWS IoT Device Shadows for frequently changing fields.
- rackId: String type – id of the server rack where the sensor belongs. Ranging from Rack1 to Rack6.
- stateNormal: Boolean type – indicating if the temperature value is within normal range: [60 to 80]. Rack5 and Rack6 are in abnormal state.
# Bash script. If in other shells, type `bash` before running
Temperatures=(70 71 72 73 74 75 47 97 98 99)
Racks=(Rack1 Rack1 Rack2 Rack2 Rack3 Rack4 Rack5 Rack6 Rack6 Rack6)
IsNormal=(true true true true true true false false false false)
for ((i=0; i<10 ; i++))
do
thing=$(aws iot create-thing --thing-name "TempSensor$i" --attribute-payload attributes="{temperature=${Temperatures[i]},rackId=${Racks[i]},stateNormal=${IsNormal[i]}}")
aws iot describe-thing --thing-name "TempSensor$i"
done
To get aggregate information of these sensors, you need to enable Fleet Indexing and define what fields to aggregate with the UpdateIndexingConfiguration
API. In the sample code below, thing-indexing-configuration
defines how things are indexed.
thingIndexingMode
: One of “OFF”|”REGISTRY”|”REGISTRY_AND_SHADOW”. Here REGISTRY means Fleet Indexing will only index the registry data of a thing.customFields
: a list of field name and type pairs. In order to perform aggregation queries on a field, we have to include the field name and type, when setting up the indexing configuration. However, this does not apply to AWS IoT managed fields such asthingName
,thingId
andregistry.version
. Managed fields support aggregation queries by default.
aws iot update-indexing-configuration --thing-indexing-configuration "thingIndexingMode=REGISTRY,customFields=[{name=attributes.temperature,type=Number},{name=attributes.rackId,type=String},{name=attributes.stateNormal,type=Boolean}]"
After turning on indexing, Fleet Indexing will create an index called AWS_Things
. The index will first be in the BUILDING
or REBUILDING
state. After the existing things are indexed, the index status will flip to ACTIVE
and be ready for queries. With 10 things, your index should be ACTIVE
within seconds to a minute. You can check the index status with the following command
aws iot describe-index --index-name "AWS_Things"
Result:
{
"indexName": "AWS_Things",
"indexStatus": "ACTIVE",
"schema": "REGISTRY"
}
Perform Aggregation Queries
Once the index is ACTIVE
, you can use GetStatistics
API to monitor the statistics of all temperature sensors.
aggregation-field
specifies which field to aggregate. Note, only custom fields defined withUpdateIndexingConfiguration
or AWS IoT managed fields can be aggregated.query-string
defines a filter on things matching the query criteria.
aws iot get-statistics --aggregation-field "attributes.temperature" --query-string "thingName:TempSensor*"
Result:
{
"statistics": {
"count": 10,
"average": 77.6,
"sum": 776.0,
"minimum": 47.0,
"maximum": 99.0,
"sumOfSquares": 62578.0,
"variance": 236.04000000000013,
"stdDeviation": 15.36359332968691
}
}
The max and min temperatures are 99 and 47, which indicates some sensors are reading anomalies. To better understand the distribution, you can leverage GetPercentiles
. This feature returns a list of probability and non-exceedance field value pairs. For example, in the following result you have {percent:90.0, value:98.1}
. This means 90% of the temperature values are below 98.1. The input percentages defines a list of percentiles to return. Notice GetPercentiles
gives approximation results. Larger aggregation set leads to more accurate estimations.
aws iot get-percentiles --aggregation-field "attributes.temperature" --query-string "thingName:TempSensor*" --percents 10 25 50 75 90
Result:
{
"percentiles": [
{
"percent": 10.0,
"value": 67.7
},
{
"percent": 25.0,
"value": 71.25
},
{
"percent": 50.0,
"value": 73.5
},
{
"percent": 75.0,
"value": 91.75
},
{
"percent": 90.0,
"value": 98.1
}
]
}
Combining the two responses above, you can conclude a few sensors are detecting anomalies. Among them, most are reporting high temperatures. With GetCardinality
, you can quickly check if the temperature problem is happening at one location (in one or two racks) or across the entire data center (all racks). Same as GetPercentiles
, the results are estimations and accuracy improves over the scale. The result below means only two racks are having abnormal temperatures.
aws iot get-cardinality --aggregation-field "attributes.rackId" --query-string "thingName:TempSensor* AND attributes.stateNormal:false"
Result:
{
"cardinality": 2
}
Among all the racks, let’s assume you know Rack6 has more servers than others. You can quickly look at the status of Rack6 using GetStatistics
. From the result below, you can confirm Rack6 is too hot as the average temperature is 98.0.
aws iot get-statistics --aggregation-field "attributes.temperature" --query-string "thingName:TempSensor* AND attributes.rackId:Rack6"
Result:
{
"statistics": {
"count": 3,
"average": 98.0,
"sum": 294.0,
"minimum": 97.0,
"maximum": 99.0,
"sumOfSquares": 28814.0,
"variance": 0.6666666666666666,
"stdDeviation": 0.816496580927726
}
}
After mitigating the problem, you can investigate the low temperature value of 47 with another Fleet Indexing API: SearchIndex
. This API returns all the things satisfying the query string. Based on the result below, you know the low value is from TempSensor6 and actions are needed at Rack5.
aws iot search-index --query-string "thingName:TempSensor* AND attributes.temperature<60"
Result:
{
"things": [
{
"thingName": "TempSensor6",
"thingId": "45b45d4e-ad95-495f-828e-47df5454ab7c",
"attributes": {
"rackId": "Rack5",
"stateNormal": "false",
"temperature": "47"
}
}
]
}
Clean Up
If you no longer need the resources in the example, the following script can help to delete all the things.
for ((i=0; i<10; i++))
do
aws iot delete-thing --thing-name "TempSensor$i"
done
It is recommended to keep Fleet Indexing enabled so the devices are always ready for queries. Optionally, you can turn thing indexing off with the following command.
aws iot update-indexing-configuration --thing-indexing-configuration thingIndexingMode=OFF
Conclusion
We have demonstrated how to use AWS IoT Device Management Fleet Indexing APIs to perform aggregation queries on devices. These APIs also work for thing group aggregations. The prerequisite is to enable Fleet Indexing and define the fields you want to aggregate with the UpdateIndexingConfiguration
API.
While performing aggregation queries, you use
query-string
to filter in the things matching the query criteria.aggregation-field
to specify what field to aggregate. Only custom fields defined withUpdateIndexingConfiguration
or AWS IoT managed fields can be aggregated.
Given a query and a field
GetStatistics
returns basic statistics (count/avg/sum/min/max etc) of the field value.GetPercentiles
returns an estimated list of percentile, field value pairs. Each pair indicates the percentile probability of a thing field value not exceeding the given field value.GetCardinality
returns an estimated count of unique field values.
Learn More
Fleet Indexing is a feature within AWS IoT Device Management. Apart from querying for aggregate data, there is much more that AWS IoT Device Management offers to register, organize, monitor, and remotely manage connected devices at scale.
AWS IoT Fleet Indexing Documentation
https://docs.aws.amazon.com/iot/latest/developerguide/iot-indexing
AWS IoT Device Management
https://aws.amazon.com/iot-device-management
AWS IoT Device Management Features
https://aws.amazon.com/iot-device-management/features