Visualization and monitoring solutions
Visualization and monitoring solutions  /  Monitor Presto
Presto logo

Monitor Presto easily with Grafana

Easily monitor your deployment of Presto, an open source distributed SQL query engine designed for running interactive analytic queries against data sources of all sizes, with Grafana Cloud’s out-of-the-box monitoring solution. The Grafana Cloud forever-free tier includes 3 users and up to 10k metrics series to support your monitoring needs.

Key metrics included

jvm_gc_collection_count
jvm_gc_duration
jvm_heap_memory_committed
jvm_heap_memory_used
jvm_nonheap_memory_committed
jvm_nonheap_memory_used
presto_ClusterMemoryPool_general_BlockedNodes
presto_ClusterMemoryPool_general_FreeDistributedBytes
presto_ClusterMemoryPool_reserved_FreeDistributedBytes
presto_HeartbeatDetector_ActiveCount
presto_MemoryPool_general_FreeBytes
presto_MemoryPool_reserved_FreeBytes
presto_QueryExecution_Executor_QueuedTaskCount
presto_QueryManager_AbandonedQueries_OneMinute_Count
presto_QueryManager_AbandonedQueries_TotalCount
presto_QueryManager_CanceledQueries_OneMinute_Count
presto_QueryManager_CanceledQueries_TotalCount
presto_QueryManager_CompletedQueries_OneMinute_Count
presto_QueryManager_CompletedQueries_OneMinute_Rate
presto_QueryManager_ConsumedCpuTimeSecs_OneMinute_Count
presto_QueryManager_CpuInputByteRate_OneMinute_Total
presto_QueryManager_ExecutionTime_OneMinute_P50
presto_QueryManager_ExecutionTime_OneMinute_P75
presto_QueryManager_ExecutionTime_OneMinute_P95
presto_QueryManager_ExecutionTime_OneMinute_P99
presto_QueryManager_FailedQueries_OneMinute_Count
presto_QueryManager_FailedQueries_TotalCount
presto_QueryManager_InsufficientResourcesFailures_OneMinute_Rate
presto_QueryManager_InsufficientResourcesFailures_TotalCount
presto_QueryManager_InternalFailures_OneMinute_Count
presto_QueryManager_InternalFailures_OneMinute_Rate
presto_QueryManager_QueuedQueries
presto_QueryManager_RunningQueries
presto_QueryManager_StartedQueries_OneMinute_Count
presto_QueryManager_StartedQueries_OneMinute_Rate
presto_QueryManager_UserErrorFailures_OneMinute_Count
presto_QueryManager_UserErrorFailures_OneMinute_Rate
presto_TaskExecutor_ProcessorExecutor_CompletedTaskCount
presto_TaskExecutor_ProcessorExecutor_CorePoolSize
presto_TaskExecutor_ProcessorExecutor_PoolSize
presto_TaskExecutor_ProcessorExecutor_QueuedTaskCount
presto_TaskManager_FailedTasks_TotalCount
presto_TaskManager_InputDataSize_OneMinute_Rate
presto_TaskManager_OutputDataSize_OneMinute_Rate
presto_TaskManager_OutputPositions_OneMinute_Rate
presto_TaskManager_TaskNotificationExecutor_PoolSize
presto_metadata_DiscoveryNodeManager_ActiveCoordinatorCount
presto_metadata_DiscoveryNodeManager_ActiveNodeCount
presto_metadata_DiscoveryNodeManager_ActiveResourceManagerCount
presto_metadata_DiscoveryNodeManager_InactiveNodeCount
up

Key alerting rules included

PrestoHighInsufficientResources (Critical)
PrestoHighTaskFailuresWarning (Warning)
PrestoHighTaskFailuresCritical (Critical)
PrestoHighQueuedTaskCount (Warning)
PrestoHighBlockedNodes (Critical)
PrestoHighFailedQueriesWarning (Warning)
PrestoHighFailedQueriesCritical (Critical)