The architecture and capabilities of HiveMQ are fundamentally derived from its comprehensive and robust implementation of the Message Queuing Telemetry Transport (MQTT) protocol. Understanding MQTT is essential to understanding HiveMQ's design philosophy and its suitability for specific domains.
II.I - The MQTT Protocol as a Foundation (ISO/IEC 20922)MQTT is an open, lightweight, publish-subscribe network protocol standardised by OASIS and ISO, designed for communication in constrained environments where network bandwidth is at a premium.
1 Its event-driven nature decouples data producers from consumers, creating a highly scalable foundation for distributed systems.
2II.I.I - Inherent StatefulnessA defining characteristic of the MQTT protocol is its built-in statefulness, which is critical for managing clients on unreliable or intermittent networks, a common scenario in IoT and IIoT. This is not a feature added by a broker but is a core part of the protocol specification that brokers like HiveMQ implement.
Quality of Service (QoS)MQTT defines three distinct levels of delivery guarantee, allowing developers to balance reliability with performance overhead.
4- QoS 0 (At most once): A "fire-and-forget" level where messages are sent without confirmation. It offers the lowest latency but no delivery guarantee, suitable for high-frequency sensor data where occasional loss is acceptable.1
- QoS 1 (At least once): Guarantees that a message will be delivered at least one time. The sender stores the message until it receives a PUBACK packet from the receiver. This can result in duplicate messages if the acknowledgement is lost, making it suitable for commands where duplication is tolerable but loss is not.1
- QoS 2 (Exactly once): The highest level of reliability, ensuring a message is delivered exactly once using a four-part handshake (PUBREC, PUBREL, PUBCOMP). This is essential for critical operations like financial transactions or remote control actions where neither loss nor duplication can be tolerated.1
Persistent Sessions & Queued MessagesWhen a client connects with the cleanSession flag set to false, the broker maintains the client's session state, including its subscriptions and any QoS 1 or QoS 2 messages that were published while the client was disconnected. Upon reconnection, the broker delivers these queued messages, ensuring data is not lost due to network instability.
1Retained Messages An MQTT publisher can set a RETAIN flag on a message. The broker then stores this message for that specific topic. When a new client subscribes to that topic, the broker immediately sends the last retained message. This provides new clients with the most recent "last known good" state of a device or sensor without having to wait for the next update, enabling instant state awareness.
4Last Will and Testament (LWT)A client can register an LWT message with the broker upon connecting. If the client disconnects ungracefully (e.g., due to power loss or network failure), the broker automatically publishes the LWT message to a specified topic. This allows all interested systems to be immediately notified of the client's offline status, enabling real-time session awareness across the entire system.
4II.II - HiveMQ Broker Architecture: Scalability and ResilienceHiveMQ is engineered as an enterprise MQTT platform designed for performance, high availability, and massive scalability.
Distributed Masterless ClusterHiveMQ employs a distributed, masterless cluster architecture. All nodes in the cluster are equal, which eliminates any single point of failure. This design provides true horizontal scalability; as more nodes are added to the cluster, the capacity to handle concurrent connections and message throughput increases linearly. A single HiveMQ node can handle hundreds of thousands of connections, and clusters have been benchmarked to support 10 million concurrent clients.
3Cluster Discovery MechanismsTo form a cluster, individual HiveMQ nodes must be able to find each other. HiveMQ supports several discovery mechanisms to accommodate different deployment environments.8
Static DiscoveryThe IP addresses and ports of all cluster nodes are explicitly listed in the configuration file. This is ideal for fixed-size deployments where high availability is the primary concern.
8Dynamic Discovery Mechanisms like UDP multicast and broadcast allow nodes to discover each other automatically at startup. This is useful for creating elastic clusters that can be scaled up or down easily. However, these mechanisms are often not supported in public cloud environments like AWS, which do not permit IP multicast traffic.
8Monitoring and ObservabilityAs a complex distributed system, robust monitoring is critical. HiveMQ exposes a wide range of metrics that can be scraped by monitoring solutions like Prometheus and visualised in dashboards like Grafana. This allows operators to track key performance indicators such as cluster and node health, CPU and memory (heap) consumption, incoming connection rates, message throughput, and the status of the built-in overload protection mechanism.
10II.III - HiveMQ at the Edge: Bridging OT and ITA significant part of HiveMQ's value proposition is its focus on solving edge computing challenges, particularly in the industrial sector.
- HiveMQ Edge as a Gateway: HiveMQ Edge is a software-based MQTT gateway designed to run on-premise, close to the source of data. Its primary function is to act as a protocol converter, translating diverse industrial protocols such as Modbus, OPC-UA, and Siemens S7 into the standardised MQTT format.12 This capability is crucial for modernising legacy industrial infrastructure and breaking down data silos between OT and IT systems.
- Enabling the Unified Namespace (UNS): HiveMQ is positioned as a foundational technology for implementing a Unified Namespace (UNS) architecture.13 A UNS is an event-driven messaging infrastructure that acts as a central data hub—a single source of truth—for all data within a manufacturing enterprise.6 HiveMQ's native support for MQTT's hierarchical topic structure, stateful features, and the Sparkplug B specification (which defines a standard topic namespace and payload format for industrial data) makes it an ideal choice for building a UNS.7
- Edge-Specific Features: HiveMQ Edge includes features tailored for the edge environment, such as offline buffering, which stores data locally on disk during network connectivity disruptions and forwards it once the connection to the central broker is re-established, ensuring zero data loss. It also features a Data Hub, an integrated policy and transformation engine that can validate, filter, and manipulate data in motion at the edge before it is sent to the cloud, reducing bandwidth costs and enabling real-time edge intelligence.12
The capabilities of HiveMQ are not merely a collection of features but are deeply rooted in its faithful and robust implementation of the MQTT standard. This makes it exceptionally well-suited for environments where client state, network reliability, and interoperability with a wide range of devices are the primary challenges. Its ecosystem, particularly HiveMQ Edge and its alignment with IIoT standards like Sparkplug, provides a significant advantage in the industrial space where connecting disparate OT equipment is a core problem.