Metrics Families

A metrics family is a group of metrics that are exposed by a single collector on every scrape.

This document lists the metrics exported by lightbox-exporter.

This document details all of the metrics and their respective labels as exported by the lightbox exporter. However, Prometheus may be configured to add additional labels to each metric, such as job or instance, Please see the following documentation for additional information: https://prometheus.io/docs/concepts/jobs_instances/.

Clustering

Collects information on Lightbits resources. Metrics will expose information such as volumes state, volumes usage, nodes state, nodes usage, etc.

NameTypeCommentsLabels
lightbox_clustering_node_infoGaugeNode information. The following values are exported via the metric's labels: Lightbits version, node UUID, server UUIDserver_id, node_id, node_lightos_version
lightbox_clustering_num_connected_hostsGaugeNumber of hosts connected to the cluster.-
lightbox_clustering_volume_logical_used_storageGaugeLogical storage space used by volume (total logical address space used by this volume), given in bytes.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_physical_used_storageGaugePhysical storage space used by volume, given in bytes.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_rebuild_progressGaugeRebuild progress of volume's replicated data, given in precentage.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_state_fully_protectedGaugeVolume is in state fully protected.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_state_degradedGaugeVolume is in state degraded.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_state_read_onlyGaugeVolume is in state read only.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_state_not_availableGaugeVolume is in state not available.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_migratingGaugeA volume replica is being migrated.server_id, node_ id, volume_id, volume_name, nsid, project_id
volume_compression_ratioGaugeVolume's compression ratio logical used (data written)/physical used.server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_physical_owned_capacityGaugeThe capacity that would be freed when the volume is deleted (will have identical value as lightbox_clustering_ volume_physical_capacity).server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_physical_capacityGaugeThe capacity used by this volume layer (physical capacity since last snapshot/rollback operation on volume).server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_physical_owned_memoryGaugeThe amount of memory that would be freed by deleting this volume (will have identical value as lightbox_clustering_volume_physical_memory).server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_physical_memoryGaugeMemory used by this volume layer (memory used since last snapshot/rollback operation on volume).server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_volume_user_writtenGaugeThe amount of user data written in this layer (data written since last snapshot/rollback operation on volume).server_id, node_id, volume_id, volume_name, nsid, project_id, project_name
lightbox_clustering_snapshot_physical_owned_capacityGaugeCapacity that would be freed when this snapshot is deleted (some of the capacity used by snapshot exposed by lightbox_clustering_snapshot___physical_capacity may still be used by volume or new snapshots, so is not freed following the snapshot delete).node_id, snapshot_id, snapshot_name, src_volume_uuid
lightbox_clustering_snapshot_physical_capacityGaugeThe physical storage used by this snapshot.node_id, snapshot_id, snapshot_name, src_volume_uuid
lightbox_clustering_snapshot_physical_owned_memoryGaugeThe amount of memory would be freed by deleting this snapshot (some of the memory used by snapshot exposed by lightbox_clustering_ snapshot_physical_memory may still be used by volume or new snapshots, so is not freed following the snapshot delete).node_id, snapshot_id, snapshot_name, src_volume_uuid
lightbox_clustering_snapshot_physical_memoryGaugeThe memory used by this snapshot.node_id, snapshot_id, snapshot_name, src_volume_uuid
lightbox_clustering_snapshot_user_writtenGaugeAmount of user data stored at this snapshot (before compression).node_id, snapshot_id, snapshot_name, src_volume_uuid
lightbox_clustering_node_num_managed_devicesGaugeNumber of NVMe SSD devices managed by node.server_id, node_id
lightbox_clustering_node_local_rebuild_progressGaugeNode local EC rebuild progress [percentage].server_id, node_id
lightbox_clustering_node_rebuild_from_peerGaugeNode is recovering replicated data.server_id, node_id
lightbox_clustering_node_logical_used_storageGaugeLogical storage space used (data written), given in bytes.server_id, node_id, node_name
lightbox_clustering_node_physical_used_storage_including_parityGaugePhysical storage space occupied by all data, including EC overhead given in bytes.server_id, node_id, node_name
lightbox_clustering_node_managed_physical_storageGaugeAll managed and healthy NVMe SSD devices capacities, given in bytes.server_id, node_id, node_name
lightbox_clustering_node_physical_used_storageGaugePhysical storage space occupied by all volumes, given in bytes.server_id, node_id, node_name
lightbox_clustering_node_free_physical_storageGaugeAvailable free physical storage capacity, given in bytes.server_id, node_id, node_name
lightbox_clustering_node_effective_physical_storageGaugeEffective physical storage excluding overhead of EC and OVP, given in bytes.server_id, node_id, node_name
lightbox_clustering_node_estimated_free_logical_storageGaugeEstimated available free logical (data written) storage capacity, given in bytes.server_id, node_id, node_name
lightbox_clustering_node_compression_ratioGaugeCompression ratio logical used (data written)/physical used.server_id, node_id, node_name
lightbox_clustering_node_estimated_logical_storageGaugeNode's estimated logical available storage (node effective physical storage x compression ratio).server_id, node_id, node_name
lightbox_clustering_node_power_up_after_abrupt_shutdownGaugeNode powerup that follows an abrupt shutdown of server or node-manager service.server_id, node_id
lightbox_clustering_node_powerup_progressGaugeProgress of node-manager service power up [percentage].serverid, node id
lightbox_clustering_node_state_inactiveGaugeNode state Inactive.server_id, node_id
lightbox_clustering_node_state_activeGaugeNode state Active.server_id, node_id
lightbox_clustering_node_read_only_modeGaugeNode is in Read Only state.server_id, node_id, node_name
lightbox_clustering_node_infoGaugeNode information. The following values are exported via the metric's labels: Lightbits version, node UUID, server UUID.server_id, node_id, node_lightos_version
lightbox_clustering_node_logical_storageGaugeSum of provisioned volumes over node, sum of size of all the allocated volumes over node - given in bytes.server_id, node_id, node_name
lightbox_clustering_node_unrecoverable_data_integrity_errorsCounterThe number of unrecoverable data integrity errors experienced by the node.server_id, node_id, node_name
lightbox_clustering_node_recoverable_data_integrity_errorsCounterThe number of recoverable data integrity errors experienced by the node.server_id, node_id, node_name
lightbox_clustering_ssd_unrecoverable_data_integrity_errorsCounterThe amount of the SSD's unrecoverable data integrity errors.server_id, node_id, node_name, serial_number
lightbox_clustering_ssd_recoverable_data_integrity_errorsCounterThe amount of the SSD's recoverable data integrity errors.server_id, node_id, node_name, serial_number

cpufreq

Exposes kernel/system CPU frequency.

NameTypeCommentsLabels
lightbox_cpu_frequency_hertzGaugeCurrent CPU thread frequency, in hertz.cpu

datapath

Exposes information from the datapath components (Lightbits GFTL).

NameTypeCommentsLabels
lightbox_be_config_md_ram_capacity_usedCounterGFTL metadata RAM usage (in equivalent storage capacity, MD usage is expressed as the equivelant amount of data that can be stored for this amount of MD usage).node_id, server_id
lightbox_be_config_md_ram_capacity_limitCounterGFTL metadata RAM limit (in equivalent storage capacity, the MD limit is expressed as the equivalent maximum amount of data that can be stored limited by MD usage).node_id, server_id
lightbox_be_poll_write_cmdsCounterNumber of commands the backend wrote.core_id, node_id, server_id
lightbox_be_poll_objectsCounterNumber of objects the backend wrote.core_id, node_id, server_id
lightbox_be_num_bio_errorCounterNumber of inflight read requests.core_id, node_id, server_id
lightbox_be_num_bio_error_reconstructCounterNumber of inflight read requests.core_id, node_id, server_id
lightbox_be_bio_error_with_lf_bypassCounterNumber of inflight read requests.core_id, node_id, server_id
lightbox_be_inflight_read_requestsCounterNumber of inflight read requests.core_id, node_id, server_id
lightbox_be_bios_per_requestCounterNumber of blockio per request.core_id, node_id, server_id
lightbox_be_compress_ratio_sumCounterCompression ratio total.core_id, node_id, server_id
lightbox_be_compress_ratio_countCounterCompression ratio count.core_id, node_id, server_id
lightbox_be_num_of_gc_unitsCounterNumber of write units written by GC since boot (counter).core_id, node_id, server_id
lightbox_be_num_of_user_unitsCounterNumber of write units of user data written since boot (counter).coreid, node id, server_id
lightbox_be_write_units_startCounterNumber of write unit start.core_id, node_id, server_id
lightbox_be_write_units_endCounterNumber of write unit end.core_id, node_id, server_id
lightbox_be_idle_cyclesGaugeGFTL idle cycles.core_id, node_id, role, server_id
lightbox_be_busy_cyclesGaugeGFTL busy cycles.core_id, node_id, role, server_id
lightbox_be_num_resetsCounterNumber of NVMeSSD device resets.device_name
lightbox_be_read_errorsCounterNumber of NVMeSSD device read errors.core_id, device_name
lightbox_be_write_errorsCounterNumber of NVMeSSD device write errors.core_id, device_name
lightbox_be_add_disk_successCounterNumber of successfully added NVMeSSD devices.node_id, server_id
lightbox_be_add_disk_failureCounterNumber of failures to add NVMeSSD devices.node_id, server_id
lightbox_be_num_disk_failuresCounterNumber of NVMeSSD devices failures.node_id, server_id
lightbox_be_num_of_gc_skipsCounterNumber of gc skips.core_id, node_id, server_id
lightbox_be_write_unit_duration_sumCounterWrite unit count - number of write units written since boot (counter); this includes both user and gc write units.core_id, node_id, server_id
lightbox_be_read_storage_duration_sumCounterRead storage duration sum.core_id, node_id, server_id
lightbox_be_read_storage_duration_countCounterRead storage duration count.core_id, node_id, server_id
lightbox_be_no_available_lf_cmdsCounterNumber of unavailable lf commands.core_id, node_id, server_id
lightbox_be_write_buffer_allocation_failureCounterNumber of write buffer allocation failure.core_id, node_id, server_id
lightbox_be_write_buffer_allocation_successCounterNumber of write buffer allocation success.core_id, node_id, server_id
lightbox_be_num_blkdev_threshold_reachedCounterNumber of inflight read requests.core_id, node_id, server_id
lightbox_be_inflight_4k_requests_irateCounterNumber of inflight read requests.core_id, node_id, server_id
lightbox_be_total_unitsCounterTotal units already recovered.node_id, server_id
lightbox_be_num_processed_storage_scheme_unitsCounterNumber of processed units.node_id, server_id
lightbox_be_num_completed_unitsCounterNumber of completed recovery units.node_id, server_id
lightbox_be_num_units_to_recoverCounterNumber of units to be recovered.node_id, server_id
lightbox_be_effective_write_unit_sizeCounterWrite unit size.node_id, server_id
lightbox_be_cherry_picked_unitsCounterNumber of write units cherry picked by GC.core_id, node_id, server_id
lightbox_be_oldest_generationCounterGeneration of the oldest write unit.core_id, node_id, server_id
lightbox_be_oldest_valid_generationCounterGeneration of the next write unit to be cherry picked.core_id, node_id, server_id
lightbox_be_writer_effective_positionCounterGeneration of the newest write unit.core_id, node_id, server_id
lightbox_be_processed_delete_objectsCounterDelete objects processed by writer_lf.core_id, node_id, server_id
lightbox_be_delete_objs_not_canceledCounterNumber of delete objects not canceled.core_id, node_id, server_id
lightbox_be_delete_total_blocks_not_canceledCounterNumber of delete 4k blocks not canceled.core_id, node_id, server_id
lightbox_be_remove_delete_objectsCounterNumber of discarded trim tombstones.core_id, node_id, server_id

duroslight

Exposes information from the Duroslight service (Lightbits FE).

NameTypeCommentsLabels
lightbox_alien_receive_batch_queue_lengthGaugeCurrent received batch queue length.nodeid, server id, shard
lightbox_alien_total_received_messagesCounterTotal number of received messages.nodeid, server id, shard
lightbox_alien_total_sent_messagesCounterTotal number of sent messages.nodeid, server id, shard
lightbox_fe_compare_handler_nr_compare_cmdsCounterTotal number of compare commands.core_id, node_id, server_id, shard
lightbox_fe_compare_handler_nr_compare_mismatchCounterNumber of objects compare mismatch.core_id, node_id, server_id, shard
lightbox_fe_compare_handler_nr_compare_mismatch_small_objCounterNumber of small objects compare mismatch.core_id, node_id, server_id, shard
lightbox_fe_compare_handler_nr_compare_mismatch_wbCounterNumber of objects compare mismatch served from the write-buffer.core_id, node_id, server_id, shard
lightbox_fe_compare_handler_nr_compare_obj_hit_wbCounterNumber of compare objects served from the write-buffer.core_id, node_id, server_id, shard
lightbox_fe_failed_abort_commandsCounterNumber of failed abort commands.core_id, node_id, server_id, shard
lightbox_fe_incoming_protocol_log_page_reqsGaugeNumber of log page requests.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_log_page_reqs_endGaugeNumber of log page requests, ended.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_log_page_reqs_startGaugeNumber of log page requests, started.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_log_page_trim_reqsGaugeNumber of TRIM log page requests.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_log_page_trim_reqs_endGaugeNumber of TRIM log page requests, ended.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_log_page_trim_reqs_startGaugeNumber of TRIM log page requests, started.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_pending_packetsGaugeNumber of pending packets.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_pending_packets_endGaugeNumber of pending packets, ended.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_pending_packets_startGaugeNumber of pending packets, started.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_read_objsGaugeNumber of read objects.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_read_objs_endGaugeNumber of read objects, ended.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_read_objs_startGaugeNumber of read objects, started.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_read_reqsGaugeNumber of read requests.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_read_reqs_endGaugeNumber of read requests, ended.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_read_reqs_startGaugeNumber of read requests, started.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_reads_missing_nsidCounterNumber of rebuild reads missing nsid.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_replication_errorsGaugeNumber of replication errors.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_replication_requestsGaugeNumber of replication requests.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_replication_requests_endGaugeNumber of replication requests, ended.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_incoming_protocol_replication_requests_startGaugeNumber of replication requests, started.core_id, counter, node_id, remote_addr, server_id, shard
lightbox_fe_limiter_outstanding_read_requestsGaugeNumber of limiter outstanding read requests.core_id, node_id, server_id, shard
lightbox_fe_limiter_outstanding_write_requestsGaugeNumber of limiter outstanding write requests..core_id, node_id, server_id, shard
lightbox_fe_limiter_read_timeoutsCounterNumber of limiter read timeouts.core_id, node_id, server_id, shard
lightbox_fe_limiter_write_timeoutsCounterNumber of limiter write timeouts.core_id, node_id, server_id, shard
lightbox_fe_n_replicator_trim_requests_rxCounterNumber of replicator trim commands per tx.core_id, node_id, server_id, shard
lightbox_fe_n_replicator_write_requests_rxCounterNumber of replicator write requests per rx.core_id, node_id, server_id, shard
lightbox_fe_n_replicator_write_requests_txCounterNumber of replicator write requests per tx.core_id, node_id, server_id, shard
lightbox_fe_n_volume_replicator_write_requests_rxGaugeNumber of replicator write requests rx per volume.core_id, node_id, server_id, shard
lightbox_fe_n_volume_replicator_write_requests_txGaugeNumber of replicator write requests tx per volume.core_id, node_id, server_id, shard
lightbox_fe_nr_fuse_requestsCounterNumber of fuse requests.core_id, node_id, server_id, shard
lightbox_fe_nr_read_bytes_0_1KGaugeRead bytes for 0-1K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_read_bytes_1K_8KGaugeRead bytes for 1K-8K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_read_bytes_32K_128KGaugeRead bytes for 32K-128K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_read_bytes_8K_32KGaugeRead bytes for 8K-32K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_read_requests_0_1KGaugeRead requests of 0-1K.core_id, node_id, server_id, shard
lightbox_fe_nr_read_requests_1K_8KGaugeRead requests of 1K-8K.core_id, node_id, server_id, shard
lightbox_fe_nr_read_requests_32K_128KGaugeRead requests of 32K-128K.core_id, node_id, server_id, shard
lightbox_fe_nr_read_requests_8K_32KGaugeRead requests of 8K-32K.core_id, node_id, server_id, shard
lightbox_fe_nr_write_bytes_0_1KGaugeWrite bytes for 0-1K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_write_bytes_1K_8KGaugeWrite bytes for 1K-8K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_write_bytes_32K_128KGaugeWrite bytes for 32K-128K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_write_bytes_8K_32KGaugeWrite bytes for 8K-32K requests.core_id, node_id, server_id, shard
lightbox_fe_nr_write_requests_0_1KGaugeWrite requests of 0-1K.core_id, node_id, server_id, shard
lightbox_fe_nr_write_requests_1K_8KGaugeWrite requests of 1K-8K.core_id, node_id, server_id, shard
lightbox_fe_nr_write_requests_32K_128KGaugeWrite requests of 32K-128K.core_id, node_id, server_id, shard
lightbox_fe_nr_write_requests_8K_32KGaugeWrite requests of 8K-32K.core_id, node_id, server_id, shard
lightbox_fe_num_tcp_connectionsGaugeTotal number of TCP connections.core_id, node_id, server_id, shard
lightbox_fe_outgoing_protocol_canceled_acked_replication_requestsCounterNumber of cancelled acked replication requests.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_canceled_acked_sync_requestsCounterNumber of cancelled acked sync requests.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_canceled_replication_requestsCounterNumber of cancelled replication requests.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_canceled_requests_pending_persistence_in_storageCounterNumber of cancelled requests persistent in storagecore_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_canceled_sync_requestsCounterNumber of cancelled sync requests.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_log_page_reqsGaugeNumber of log page requests.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_log_page_reqs_endGaugeNumber of log page requests, ended.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_log_page_reqs_startGaugeNumber of log page requests, started.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_read_reqsGaugeNumber of read requests.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_read_reqs_endGaugeNumber of read requests, ended.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_read_reqs_startGaugeNumber of read requests, started.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_replication_reqsGaugeNumber of replication requests.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_replication_reqs_endGaugeNumber of replication requests, ended.core_id, count, node_id, server_id, shard
lightbox_fe_outgoing_protocol_replication_reqs_startGaugeNumber of replication requests, started.core_id, count, node_id, server_id, shard
lightbox_fe_read_handler_read_cmdsGaugeNumber of inflight read commands.core_id, node_id, server_id, shard
lightbox_fe_read_handler_resubmit_read_cmdsCounterNumber of resubmitted read commands.core_id, node_id, server_id, shard
lightbox_fe_read_modify_write_handler_nr_4K_aligned_objsCounterNumber of 4k aligned rmw objects.core_id, node_id, server_id, shard
lightbox_fe_read_modify_write_handler_nr_merged_objsCounterNumber of rmw merged objects.core_id, node_id, server_id, shard
lightbox_fe_read_modify_write_handler_nr_read_failed_objsCounterNumber of failed reads on rmw objects.core_id, node_id, server_id, shard
lightbox_fe_read_modify_write_handler_nr_read_modify_objsCounterNumber of rmw objects.core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_0_1K_avgGaugeRead requests for 0-1K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_0_1K_countGaugeRead requests for 0-1K size sample count.core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_0_1K_sumGaugeRead requests for 0-1K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_1K_8K_avgGaugeRead requests for 1K-8K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_1K_8K_countGaugeRead requests for 1K-8K size sample count.core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_1K_8K_sumGaugeRead requests for 1K-8K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_32K_128K_avgGaugeRead requests for 32K-128K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_32K_128K_countGaugeRead requests for 32K-128K size sample count.core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_32K_128K_sumGaugeRead requests for 32K-128K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_8K_32K_avgGaugeRead requests for 8K-32K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_8K_32K_countGaugeRead requests for 8K-32K size sample count.core_id, node_id, server_id, shard
lightbox_fe_read_request_duration_8K_32K_sumGaugeRead requests for 8K-32K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_replicator_write_bandwidth_rxCounterNumber of replicator write bytes per rx.core_id, node_id, server_id, shard
lightbox_fe_replicator_write_bandwidth_txCounterNumber of replicator write bytes per tx.core_id, node_id, server_id, shard
lightbox_fe_retries_after_version_errorCounterNumber of replicator retries after version error.core_id, node_id, server_id, shard
lightbox_fe_seastar_current_loadGaugeCurrent load in this shard, in percentage.core_id, node_id, server_id, shard
lightbox_fe_seastar_memory_allocated_memoryGaugeAmount of allocated memory for the core.core_id, node_id, server_id, shard
lightbox_fe_seastar_memory_cross_cpu_freesCounterNumber of cross-CPU frees.core_id, node_id, server_id, shard
lightbox_fe_seastar_memory_free_memoryGaugeAmount of free memory for core.core_id, node_id, server_id, shard
lightbox_fe_seastar_memory_large_allocationsCounterNumber of large memory allocations.core_id, node_id, server_id, shard
lightbox_fe_seastar_memory_live_objectsGaugeNumber of live objects.core_id, node_id, server_id, shard
lightbox_fe_seastar_memory_reclaimsCounterNumber of memory reclaims.core_id, node_id, server_id, shard
lightbox_fe_seastar_memory_total_memoryCounterTotal amount of memory for the core.core_id, node_id, server_id, shard
lightbox_fe_seastar_total_pollsCounterTotal number of shard polls.core_id, node_id, server_id, shard
lightbox_fe_seastar_total_tasksCounterTotal number of shard tasks.core_id, node_id, server_id, shard
lightbox_fe_semaphore_service_allocated_units_rGaugeAllocated number of read sem units.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_allocated_units_wGaugeAllocated number of write sem units.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_req_units_rGaugeNumber of inflight read sem waits.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_req_units_wGaugeNumber of inflight write sem waits.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_total_units_rGaugeTotal number of read sem units.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_total_units_wGaugeTotal number of write sem units.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_wait_count_rGaugeRead sem wait count.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_wait_count_wGaugeWrite sem wait count.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_wait_time_rGaugeRead sem wait time.core_id, name, node_id, server_id, shard
lightbox_fe_semaphore_service_wait_time_wGaugeWrite sem wait time.core_id, name, node_id, server_id, shard
lightbox_fe_stage_buffer_objects_insertedCounterNumber of objects inserted.core_id, node_id, server_id, shard
lightbox_fe_stage_buffer_objects_overrideCounterNumber of objects overridden.core_id, node_id, server_id, shard
lightbox_fe_stage_buffer_objects_removedCounterNumber of objects removed.core_id, node_id, server_id, shard
lightbox_fe_success_abort_commandsCounterNumber of successful abort commands.core_id, node_id, server_id, shard
lightbox_fe_trim_request_duration_countCounterTotal number of aggregate TRIM requests.core_id, node_id, server_id, shard
lightbox_trim_request_duration_sumCounterNumber of TRIM requests.core_id, node_id, server_id, shard
lightbox_fe_volume_read_qos_wait_countGaugeRead request count for qos volume.nodeid, project id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_qos_wait_timeGaugeRead request qos wait time (microseconds).nodeid, project id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_0_1K_avgGaugeVolume read requests for 0-1K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_0_1K_countGaugeVolume read requests for 0-1K size count.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_0_1K_sumGaugeVolume read requests for 0-1K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_1K_8K_avgGaugeVolume read requests for 1K-8K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_1K_8K_countGaugeVolume read requests for 1K-8K size count.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_1K_8K_sumGaugeVolume read requests for 1K-8K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_32K_128K_avgGaugeVolume read requests for 32K-128K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_32K_128K_countGaugeVolume read requests for 32K-128K size count.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_32K_128K_sumGaugeVolume read requests for 32K-128K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_8K_32K_avgGaugeVolume read requests for 8K-32K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_8K_32K_countGaugeVolume read requests for 8K-32K size count.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_request_duration_8K_32K_sumGaugeVolume read requests for 8K-32K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_requests_0_1KGaugeRead requests for 0-1K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_requests_1K_8KGaugeRead requests for 1K-8K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_requests_32K_128KGaugeRead requests for 32K-128K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_requests_8K_32KGaugeRead requests for 8K-32K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_total_0_1KGaugeRead bytes for 0-1K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_total_1K_8KGaugeRead bytes for 1K-8K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_total_32K_128KGaugeRead bytes for 32K-128K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_read_total_8K_32KGaugeRead bytes for 8K-32K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_replicator_write_bandwidth_rxGaugereplicator_write_bandwidth_rxnode_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_replicator_write_bandwidth_txGaugereplicator_write_bandwidth_txnode_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_qos_wait_countGaugeWrite request count for qos volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_qos_wait_timeGaugeWrite request qos wait time (microseconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_0_1K_avgGaugeVolume write requests for 0-1K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_0_1K_countGaugeVolume write requests for 0-1K size countnode_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_0_1K_sumGaugeVolume write requests for 0-1K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_1K_8K_avgGaugeVolume write requests for 1K-8K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_1K_8K_countGaugeVolume write requests for 1K-8K size count.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_1K_8K_sumGaugeVolume write requests for 1K-8K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_32K_128K_avgGaugeVolume write requests for 32K-128K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_32K_128K_countGaugeVolume write requests for 32K-128K size count.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_32K_128K_sumGaugeVolume write requests for 32K-128K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_8K_32K_avgGaugeVolume write requests for 8K-32K size average duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_8K_32K_countGaugeVolume write requests for 8K-32K size count.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_request_duration_8K_32K_sumGaugeVolume write requests for 8K-32K size total duration (seconds).node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_requests_0_1KGaugeWrite requests for 0-1K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_requests_1K_8KGaugeWrite requests for 1K-8K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_requests_32K_128KGaugeWrite requests for 32K-128K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_requests_8K_32KGaugeWrite requests for 8K-32K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_total_0_1KGaugeWrite bytes for 0-1K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_total_1K_8KGaugeWrite bytes for 1K-8K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_total_32K_128KGaugeWrite bytes for 32K-128K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_volume_write_total_8K_32KGaugeWrite bytes for 8K-32K size per volume.node_id, project_id, project_name, server_id, volume_id, volume_name
lightbox_fe_write_buffer_double_dup_objects_insertedCounterNumber of duplicate objects double inserted.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_dup_objects_insertedCounterNumber of duplicate objects inserted.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_inflight_objectsGaugeNumber of objects inflight.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_max_inflight_objectsGaugeMaximum number of inflight objects.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_nr_write_readyGaugeNumber of objects that are write-ready.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_objects_foundCounterNumber of objects found.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_objects_insertedCounterNumber of objects inserted.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_objects_polledCounterNumber of objects polled.core_id, node_id, server_id, shard
lightbox_fe_write_buffer_objects_removedCounterNumber of objects removed.core_id, node_id, server_id, shard
lightbox_fe_write_handler_flush_due_to_resubmit_cmdCounterNumber of times a flush occurred to a resubmit of a write command.core_id, node_id, server_id, shard
lightbox_fe_write_handler_resubmit_write_cmdsCounterNumber of resubmitted write commands.core_id, node_id, server_id, shard
lightbox_fe_write_handler_write_cmdsGaugeNumber of inflight write commands.core_id, node_id, server_id, shard
lightbox_fe_write_handler_write_reqsGaugeNumber of inflight write requests.core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_0_1K_avgGaugeWrite requests for 0-1K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_0_1K_countGaugeWrite requests for 0-1K size sample count.core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_0_1K_sumGaugeWrite requests for 0-1K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_1K_8K_avgGaugeWrite requests for 1K-8K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_1K_8K_countGaugeWrite requests for 1K-8K size sample count.core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_1K_8K_sumGaugeWrite requests for 1K-8K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_32K_128K_avgGaugeWrite requests for 32K-128K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_32K_128K_countGaugeWrite requests for 32K-128K size sample count.core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_32K_128K_sumGaugeWrite requests for 32K-128K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_8K_32K_avgGaugeWrite requests for 8K-32K size average duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_8K_32K_countGaugeWrite requests for 8K-32K size sample count.core_id, node_id, server_id, shard
lightbox_fe_write_request_duration_8K_32K_sumGaugeWrite requests for 8K-32K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_0_1K_countGaugeWrite latency to data done for 0-1K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_0_1K_sumGaugeWrite latency to data done for 0-1K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_1K_8K_countGaugeWrite latency to data done for 1K-8K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_1K_8K_sumGaugeWrite latency to data done for 1K-8K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_32K_128K_countGaugeWrite latency to data done for 32K-128K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_32K_128K_sumGaugeWrite latency to data done for 32K-128K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_8K_32K_countGaugeWrite latency to data done for 8K-32K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_data_8K_32K_sumGaugeWrite latency to data done for 8K-32K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_0_1K_countGaugeWrite latency to r2t data 0-1K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_0_1K_sumGaugeWrite latency to r2t data 0-1K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_1k_8K_countGaugeWrite latency to r2t data 1k-8K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_1k_8K_sumGaugeWrite latency to r2t data 1k-8K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_32k_128K_countGaugeWrite latency to r2t data 32k-128K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_32k_128K_sumGaugeWrite latency to r2t data 32k-128K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_8k_32K_countGaugeWrite latency to r2t data 8k-32K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_r2t_8k_32K_sumGaugeWrite latency to r2t data 8k-32K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_0_1K_countGaugeWrite latency to repl for 0-1K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_0_1K_sumGaugefe wrlat_repl_0_1K_sum).Write latency to repl for 0-1K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_1K_8K_countGaugeWrite latency to repl for 1K-8K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_1K_8K_sumGaugeWrite latency to repl for 1K-8K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_32K_128K_countGaugeWrite latency to repl for 32K-128K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_32K_128K_sumGaugeWrite latency to repl for 32K-128K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_8K_32K_countGaugeWrite latency to repl for 8K-32K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_8K_32K_sumGaugeWrite latency to repl for 8K-32K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_0_1K_countGaugeWrite latency to repl_ackd for 0-1K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_0_1K_sumGaugeWrite latency to repl_ackd for 0-1K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_1K_8K_countGaugeWrite latency to repl_ackd for 1K-8K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_1K_8K_sumGaugeWrite latency to repl_ackd for 1K-8K size total duration (microseconds)core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_32K_128K_countGaugeWrite latency to repl_ackd for 32K-128K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_32K_128K_sumGaugeWrite latency to repl_ackd for 32K-128K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_8K_32K_countGaugeWrite latency to repl_ackd for 8K-32K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_ackd_8K_32K_sumGaugeWrite latency to repl_ackd for 8K-32K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_0_1K_countGaugeWrite latency to repl_sent for 0-1K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_0_1K_sumGaugeWrite latency to repl_sent for 0-1K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_1K_8K_countGaugeWrite latency to repl_sent for 1K-8K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_1K_8K_sumGaugeWrite latency to repl_sent for 1K-8K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_32K_128K_countGaugeWrite latency to repl_sent for 32K-128K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_32K_128K_sumGaugeWrite latency to repl_sent for 32K-128K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_8K_32K_countGaugeWrite latency to repl_sent for 8K-32K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_repl_sent_8K_32K_sumGaugeWrite latency to repl_sent for 8K-32K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_0_1K_countGaugeWrite latency to sec_persist for 0-1K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_0_1K_sumGaugeWrite latency to sec_persist for 0-1K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_1K_8K_countGaugeWrite latency to sec_persist for 1K-8K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_1K_8K_sumGaugeWrite latency to sec_persist for 1K-8K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_32K_128K_countGaugeWrite latency to sec_persist for 32K-128K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_32K_128K_sumGaugeWrite latency to sec_persist for 32K-128K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_8K_32K_countGaugeWrite latency to sec_persist for 8K-32K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sec_persist_8K_32K_sumGaugeWrite latency to sec_persist for 8K-32K size total duration (microseconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_0_1K_countGaugeWrite latency to sharded for 0-1K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_0_1K_sumGaugeWrite latency to sharded for 0-1K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_1K_8K_countGaugeWrite latency to sharded for 1K-8K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_1K_8K_sumGaugeWrite latency to sharded for 1K-8K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_32K_128K_countGaugeWrite latency to sharded for 32K-128K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_32K_128K_sumGaugeWrite latency to sharded for 32K-128K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_8K_32K_countGaugeWrite latency to sharded for 8K-32K size count.core_id, node_id, server_id, shard
lightbox_fe_wrlat_sharded_8K_32K_sumGaugeWrite latency to sharded for 8K-32K size total duration (seconds).core_id, node_id, server_id, shard
lightbox_httpd_connections_currentGaugeCurrent number of open connections.node_id,server_id,service,shard
lightbox_httpd_connections_totalCounterTotal number of connections opened.node_id,server_id,service,shard
lightbox_httpd_read_errorsCounterTotal number of errors while reading HTTP requests.node_id,server_id,service,shard
lightbox_httpd_reply_errorsCounterTotal number of errors while replying to HTTP.node_id,server_id,service,shard
lightbox_httpd_requests_servedCounterTotal number of HTTP requests served.node_id,server_id,service,shard
lightbox_memory_allocated_memoryGaugeAllocated memory size in bytes.node_id, server_id, shard
lightbox_memory_cross_cpu_free_operationsCounterTotal number of cross-CPUs free.node_id, server_id, shard
lightbox_memory_free_memoryGaugeFree memory size in bytes.node_id, server_id, shard
lightbox_memory_free_operationsCounterTotal number of free operations.node_id, server_id, shard
lightbox_memory_failedCounterTotal count of failed memory allocations.node_id, server_id, shard
lightbox_memory_malloc_live_objectsGaugeNumber of live objects.node_id, server_id, shard
lightbox_memory_malloc_operationsCounterTotal number of malloc operations.node_id, server_id, shard
lightbox_memory_reclaims_operationsCounterTotal reclaims operations.node_id, server_id, shard
lightbox_memory_total_memoryGaugeTotal memory size in bytes.node_id, server_id, shard
lightbox_reactor_abandoned_failed_futuresCounterTotal number of abandoned failed futures and futures destroyed while still containing an exception.node_id, server_id, shard
lightbox_reactor_aio_bytes_readCounterTotal aio-reads bytes.node_id, server_id, shard
lightbox_reactor_aio_bytes_writeCounterTotal aio-writes bytes.node_id, server_id, shard
lightbox_reactor_aio_errorsCounterTotal aio errors.node_id, server_id, shard
lightbox_reactor_aio_outsizesCounterTotal number of aio operations that exceed the IO limit.node_id, server_id, shard
lightbox_reactor_aio_readsCounterTotal aio-reads operations.node_id, server_id, shard
lightbox_reactor_aio_writesCounterTotal aio-writes operations.node_id, server_id, shard
lightbox_reactor_awake_time_ms_totalCounterTotal reactor awake time (wall clock).node_id, server_id, shard
lightbox_reactor_cpp_exceptionsCounterTotal number of C++ exceptions.node_id, server_id, shard
lightbox_reactor_cpu_busy_msCounterTotal CPU busy time in milliseconds.node_id, server_id, shard
lightbox_reactor_cpu_steal_time_msCounterTotal steal time - the time in which something else was running while the reactor was runnable (not sleeping). Because this is in userspace, some time that could be legitimately thought of as steal time is not accounted as such. For example, if we are sleeping and can wake up but the kernel has not woken us up yet.node_id, server_id, shard
lightbox_reactor_cpu_steal_time_msCounterTotal steal time - the time in which something else was running while the reactor was runnable (not sleeping). Because this is in userspace, some time that could be legitimately thought of as steal time is not accounted as such. For example, if we are sleeping and can wake up but the kernel has not woken us up yet.node_id, server_id, shard
lightbox_reactor_cpu_used_time_msCounterTotal reactor thread CPU time (from CLOCK_THREAD_CPUTIME).node_id, server_id, shard
lightbox_reactor_fstream_read_bytes_blockedCounterCounts the number of bytes read from disk that could not be satisfied from read-ahead buffers, and had to block. Indicates short streams, or incorrect read ahead configuration.node_id, server_id, shard
lightbox_reactor_fstream_readsCounterCounts reads from disk file streams. A high rate indicates high disk activity. Contrast with other fstream_read* counters to locate bottlenecks.node_id, server_id, shard
lightbox_reactor_fstream_reads_ahead_bytes_discardedCounterCounts the number of buffered bytes that were read ahead of time and were discarded because they were not needed, wasting disk bandwidth. Indicates over-eager read ahead configuration.node_id, server_id, shard
lightbox_reactor_fstream_reads_aheads_discardedCounterCounts the number of times a buffer that was read ahead of time and was discarded because it was not needed, wasting disk bandwidth. Indicates over-eager read ahead configuration.node_id, server_id, shard
lightbox_reactor_fstream_reads_blockedCounterCounts the number of times a disk read could not be satisfied from read-ahead buffers, and had to block. Indicates short streams, or an incorrect read ahead configuration.node_id, server_id, shard
lightbox_reactor_fsyncsCounterTotal number of fsync operations.node_id, server_id, shard
lightbox_reactor_io_threaded_fallbacksCounterTotal number of IO-threaded-fallbacks operations.node_id, server_id, shard
lightbox_reactor_logging_failuresCounterTotal number of logging failures.node_id, server_id, shard
lightbox_reactor_pollsCounterNumber of times pollers were executed.node_id, server_id, shard
lightbox_reactor_sleep_time_ms_totalCounterTotal reactor sleep time (wall clock).node_id, server_id, shard
lightbox_reactor_tasks_pendingGaugeNumber of pending tasks in the queue.node_id, server_id, shard
lightbox_reactor_tasks_processedCounterTotal tasks processed.node_id, server_id, shard
lightbox_reactor_timers_pendingGaugeNumber of tasks in the timer-pending queue.node_id, server_id, shard
lightbox_reactor_utilizationGaugeCPU utilization.node_id, server_id, shard
lightbox_scheduler_queue_lengthGaugeSize of backlog on this queue, in tasks; indicates whether the queue is busy and/or contended.group,node_id,server_id,shard
lightbox_scheduler_runtime_msCounterAccumulated runtime of this task queue; an increment rate of 1000ms per second indicates full utilization.group,node_id,server_id,shard
lightbox_scheduler_sharesGaugeShares allocated to this queue.group,node_id,server_id,shard
lightbox_scheduler_starvetime_msCounterAccumulated starvation time of this task queue; an increment rate of 1000ms per second indicates the scheduler is unwell.group,node_id,server_id,shard
lightbox_scheduler_tasks_processedCounterCount of tasks executing on this queue; indicates together with runtime_ms the length of tasks.group,node_id,server_id,shard
lightbox_scheduler_time_spent_on_task_quota_violations_msCounterTotal amount in milliseconds tin violation of the task quota.group,node_id,server_id,shard
lightbox_scheduler_waittime_msCounterAccumulated waittime of this task queue; an increment rate of 1000ms per second indicates that the queue is waiting for something (e.g., IO).group,node_id,server_id,shard

meminfo

OS memory usage and export information, from linux /proc/meminfo.

NameTypeCommentsLabels
lightbox_meminfo_Active_anon_bytesGauge--
lightbox_meminfo_Active_bytesGaugeMemory that has been used more recently, and usually not reclaimed unless absolutely necessary.-
lightbox_meminfo_Active_file_bytesGauge--
lightbox_meminfo_AnonHugePages_bytesGauge--
lightbox_meminfo_AnonPages_bytesGauge--
lightbox_meminfo_Bounce_bytesGauge--
lightbox_meminfo_Buffers_bytesGaugeMemory in buffer cache in bytes.-
lightbox_meminfo_Cached_bytesGaugeMemory in the pagecache (Diskcache) minus SwapCache. Does not include SwapCached.-
lightbox_meminfo_CmaFree_bytesGauge--
lightbox_meminfo_CmaTotal_bytesGauge--
lightbox_meminfo_CommitLimit_bytesGauge--
lightbox_meminfo_Committed_AS_bytesGaugeAn estimate of how much RAM you would need to make a 99.99% guarantee that there is never OOM (out of memory) for this workload. Normally, the kernel will overcommit memory. This means for example that if you do a 1GB malloc, only when you start USING that malloc memory will you get real memory on demand, and only as much as you use. Other cases might include when you mmap a file that is shared only when you write to it, and you get a private copy of that data. While it normally is shared between processes, the Committed_AS is an estimate of how much RAM/swap you would need in the worst case.-
lightbox_meminfo_DirectMap1G_bytesGauge--
lightbox_meminfo_DirectMap2M_bytesGauge--
lightbox_meminfo_DirectMap4k_bytesGauge--
lightbox_meminfo_Dirty_bytesGaugeMemory that is waiting to get written back to the disk.-
lightbox_meminfo_HardwareCorrupted_bytesGauge--
lightbox_meminfo_HugePages_FreeGauge--
lightbox_meminfo_HugePages_RsvdGauge--
lightbox_meminfo_HugePages_SurpGauge--
lightbox_meminfo_HugePages_TotalGauge--
lightbox_meminfo_Hugepagesize_bytesGauge--
lightbox_meminfo_Inactive_anon_bytesGauge--
lightbox_meminfo_Inactive_bytesGaugeThe total amount of buffer or page cache memory, in bytes, that are free and available. This is memory that has not been recently used and can be reclaimed for other purposes.-
lightbox_meminfo_Inactive_file_bytesGauge--
lightbox_meminfo_KernelStack_bytesGauge--
lightbox_meminfo_Mapped_bytesGaugeFiles that have been mapped, such as libraries.-
lightbox_meminfo_MemAvailable_bytesGauge--
lightbox_meminfo_MemFree_bytesGaugeIs sum of LowFree+HighFree (overall stat).-
lightbox_meminfo_MemTotal_bytesGaugeTotal usable RAM (i.e., physical RAM minus a few reserved bits and the kernel binary code).-
lightbox_meminfo_Mlocked_bytesGauge--
lightbox_meminfo_NFS_Unstable_bytesGauge--
lightbox_meminfo_PageTables_bytesGaugeAmount of memory dedicated to the lowest level of page tables.-
lightbox_meminfo_SReclaimable_bytesGauge--
lightbox_meminfo_SUnreclaim_bytesGauge--
lightbox_meminfo_ShmemHugePages_bytesGauge--
lightbox_meminfo_ShmemPmdMapped_bytesGauge--
lightbox_meminfo_Shmem_bytesGauge--
lightbox_meminfo_Slab_bytesGaugeIn-kernel data structures cache.-
lightbox_meminfo_SwapCached_bytesGaugeMemory that was once swapped out, is swapped back in, but still also is in the swapfile (if memory is needed it does not need to be swapped out again because it is already in the swapfile. This saves I/O).-
lightbox_meminfo_SwapFree_bytesGaugeTotal amount of swap memory free. This is memory that has been evicted from RAM, and is temporarily on the disk.-
lightbox_meminfo_SwapTotal_bytesGaugeTotal amount of physical swap memory.-
lightbox_meminfo_Unevictable_bytesGauge--
lightbox_meminfo_VmallocChunk_bytesGaugeLargest contigiuous block of vmalloc area that is free.-
lightbox_meminfo_VmallocTotal_bytesGaugeTotal size of vmalloc memory area.-
lightbox_meminfo_VmallocUsed_bytesGaugeAmount of vmalloc area used.-
lightbox_meminfo_WritebackTmp_bytesGauge--
lightbox_meminfo_Writeback_bytesGaugeMemory that is actively being written back to the disk.-

netdev

Network devices information, fetched from linux /proc/net/dev.

NameTypeCommentsLabels
lightbox_netdev_receive_bytes_totalGaugeTotal received bytes.device
lightbox_netdev_receive_compressed_totalGauge-device
lightbox_netdev_receive_drop_totalGaugeTotal number of packets dropped by the network device.device
lightbox_netdev_receive_errs_totalGaugeTotal number of receive errors detected by the network device.device
lightbox_netdev_receive_fifo_totalGauge-device
lightbox_netdev_receive_frame_totalGauge-device
lightbox_netdev_receive_multicast_totalGauge-device
lightbox_netdev_receive_packets_totalGaugeTotal received packets.device
lightbox_netdev_transmit_bytes_totalGaugeTotal transmitted bytes.device
lightbox_netdev_transmit_compressed_totalGauge-device
lightbox_netdev_transmit_drop_totalGauge-device
lightbox_netdev_transmit_errs_totalGauge-device
lightbox_netdev_transmit_fifo_totalGauge-device
lightbox_netdev_transmit_frame_totalGauge-device
lightbox_netdev_transmit_multicast_totalGauge-device
lightbox_netdev_transmit_packets_totalGaugeTotal transmitted packets.device

netstat

Network statistics, fetched from linux /proc/net/netstat.

NameTypeCommentsLabels
lightbox_netstat_Icmp6_InErrorsGauge--
lightbox_netstat_Icmp6_InMsgsGauge--
lightbox_netstat_Icmp6_OutMsgsGauge--
lightbox_netstat_Icmp_InErrorsGauge--
lightbox_netstat_Icmp_InMsgsGauge--
lightbox_netstat_Icmp_OutMsgsGauge--
lightbox_netstat_Ip6_InOctetsGauge--
lightbox_netstat_Ip6_OutOctetsGauge--
lightbox_netstat_IpExt_InOctetsGauge--
lightbox_netstat_IpExt_OutOctetsGauge--
lightbox_netstat_Ip_ForwardingGauge--
lightbox_netstat_TcpExt_ListenDropsGauge--
lightbox_netstat_TcpExt_ListenOverflowsGauge--
lightbox_netstat_TcpExt_SyncookiesFailedGauge--
lightbox_netstat_TcpExt_SyncookiesRecvGauge--
lightbox_netstat_TcpExt_SyncookiesSentGauge--
lightbox_netstat_TcpExt_TCPSynRetransGauge--
lightbox_netstat_Tcp_ActiveOpensGauge--
lightbox_netstat_Tcp_CurrEstabGauge--
lightbox_netstat_Tcp_InErrsGauge--
lightbox_netstat_Tcp_InSegsGauge--
lightbox_netstat_Tcp_OutSegsGauge--
lightbox_netstat_Tcp_PassiveOpensGauge--
lightbox_netstat_Tcp_RetransSegsGauge--
lightbox_netstat_Udp6_InDatagramsGauge--
lightbox_netstat_Udp6_InErrorsGauge--
lightbox_netstat_Udp6_NoPortsGauge--
lightbox_netstat_Udp6_OutDatagramsGauge--
lightbox_netstat_UdpLite6_InErrorsGauge--
lightbox_netstat_UdpLite_InErrorsGauge--
lightbox_netstat_Udp_InDatagramsGauge--
lightbox_netstat_Udp_InErrorsGauge--
lightbox_netstat_Udp_NoPortsGauge--
lightbox_netstat_Udp_OutDatagramsGauge--

smart

Exposing disk smart statistics (for more information. See the Linux documentation for the smartctl cmd.

NameTypeCommentsLabels
lightbox_smart_available_spare_percentGaugeAvailable spare [percent].disk, type
lightbox_smart_available_spare_threshold_percentGaugeAvailable spare threshold [percent].disk, type
lightbox_smart_controller_busy_timeCounterController busy time.disk, type
lightbox_smart_critical_comp_temperature_timeCounterCritical composite temperature time.disk, type
lightbox_smart_critical_warningGaugeNumber of critical warnings.disk, type
lightbox_smart_data_units_readCounterNumber of data units read.disk, type
lightbox_smart_data_units_writtenCounterNumber of data units written.disk, type
lightbox_smart_error_information_log_entriesCounterNumber of error information log entries.disk, type
lightbox_smart_host_read_commandsCounterNumber of host read commands.disk, type
lightbox_smart_host_write_commandsCounterNumber of host write commands.disk, type
lightbox_smart_media_and_data_integrity_errorsCounterNumber of media and data integrity errors.disk, type
lightbox_smart_percentage_usedGaugePercentage used.disk, type
lightbox_smart_power_cyclesCounterNumber of power cycles.disk, type
lightbox_smart_power_on_hoursCounterNumber of hours powered on.disk, type
lightbox_smart_temperature_celsiusGaugeTemperature [celsius].disk, type
CounterNumber of host read commands.disk, type
lightbox_smart_unsafe_shutdownsCounterNumber of unsafe shutdowns.disk, type
lightbox_smart_warning_comp_temperature_timeCounterWarning composite temperature time.disk, type

systemd

Collects information on Lightbits services. Metrics expose information such as services state, and usage.

NameTypeHelpLabels
lightbox_systemd_service_restart_totalGaugeService unit count of Restart triggers.device
lightbox_systemd_socket_accepted_connections_totalGaugeTotal number of accepted socket connections.device
lightbox_systemd_socket_current_connectionsGaugeCurrent number of accepted socket connections.device
lightbox_systemd_socket_refused_connections_totalGaugeTotal number of refused socket connections.device
lightbox_systemd_system_runningGaugeWhether the system is operational (see 'systemctl is-system-running').device
lightbox_systemd_timer_last_trigger_secondsGaugeSeconds since epoch of last trigger.device
lightbox_systemd_unit_start_time_secondsGaugeStart time of the unit since unix epoch in seconds.device
lightbox_systemd_unit_stateGaugeSystemd unit.device
lightbox_systemd_unit_tasks_currentGaugeCurrent number of tasks per Systemd unit.device
lightbox_systemd_unit_tasks_maxGaugeMaximum number of tasks per Systemd unit.device
lightbox_systemd_unitsGaugeSummary of Systemd unit states.device
lightbox_systemd_versionGaugeDetected Systemd version.device
Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard