The GNUnet communicators

Internet-Draft	The GNUnet communicators	July 2025
Schanzenbach, et al.	Expires 1 February 2026	[Page]

Abstract

This document contains the GNUnet communicator specification.¶

This document defines the normative wire format of communicator protocols, cryptographic routines and security considerations for use by implementers.¶

This specification was developed outside the IETF and does not have IETF consensus. It is published here to inform readers about the function of GNUnet communicators, guide future communicator implementations, and ensure interoperability among implementations including with the pre-existing GNUnet implementation.¶

6. UDP communicator

The UDP communicator implements an encryption layer that protects both the payload and the communicator's specific metadata (not to be confused with the UDP header). In particular, any message sent by the communicator is indistinguishable from random payload to an outside observer, with the exception of broadcast messages.¶

For any new connection to a target peer, the communicator attempts to establish a shared secret using the KEM defined in Section 5.1. The communicator initiating the connection sends the resulting Elligator representative, the authentication tag and the encrypted data to the receiving peer. Since the GCM authentication tag and the encrypted data in the key exchange message also appear random, the entire message is computationally indistinguishable from a random byte stream. Independent of the payload, each message includes the sender's peer identity, a monotonic timestamp, and a signature over the session metadata. Receivers MUST keep track of the monotonic timestamps of key exchanges with each peer to prevent replay attacks. For each subsequent message the same procedure is used with a new encapsulation. While the communicator may always fall back to this type of encryption, it is inefficient for high-volume data transfer because a new key exchange is required for every message. However, sometimes we may have no other choice, for example, if there is only bi-directional connectivity to the receiving peer.¶

If the target peer is able to acknowledge the reception of a message, the employed key can be reused. Such acknowledgments can be sent either via a bi-directional UDP connection or a backchannel connection provided by TRANSPORT. This acknowledgment prompts the communicator to offer a new queue to TRANSPORT, which has a higher priority than the default queue but starts with limited capacity. The capacity increases whenever the communicator receives an acknowledgment for a transmission. This queue is suitable for high-volume data transfer, and TRANSPORT will prioritize it if available.¶

There are three distinct message types that are sent and received by UDP communicators: KX, BOX and BROADCAST. For KX and BOX messages, their metadata is chosen such that they are indistinguishable from random. This property does not hold for BROADCAST messages and is not necessary, as they are only sent within a local area network.¶

6.1. Key exchange

Independent of the type of message queue, a key exchange is initiated at least once by the sending peer. In cases where the receiving peer cannot acknowledge the reception of messages, a key exchange is performed for every message. Two key pairs are needed for the KEM: An ephemeral key pair generated as part of the encapsulation procedure Encap(pkR) -> (MSK,enc) and the peer identity of the receiving communciator. The algorithm in use for the KEM is DHKEM(X25519+Elligator, HKDF-SHA256) [LSD0011]. The peer identity of the receiving communicator is an Ed25519 public key pkRed. In order to use it compliantly with a X25519-based DHKEM as defined in [LSD0011] and [RFC9180], the curve point must first be converted from Edwards into its birationally equivalent Montgomery form pkR. The encapsulation enc is transferred via a key exchange (KX) message, as defined in Figure 1.¶

0           8           16          24
+-----+-----+-----+-----+-----+-----+-----+-----+
|                    ENC                        |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                    GCM TAG                    |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
/                 ENCRYPTED DATA                /
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 1: The binary representation of the KX message.

ENC: The 256-bit serialized encapsulation result enc of the KEM.¶
GCM TAG: The 128-bit GCM tag is used to authenticate the ciphertext immediately following the header part of the KX message.¶
ENCRYPTED DATA: The remaining data (as indicated by SIZE) is AES-GCM encrypted using the current session key and authenticated through the GCM TAG.¶

In order to prevent replay attacks for KX messages, the plaintext resulting from decryption of the encrypted data MUST must start with a session-specific Confirmation header as defined in Figure 2. It includes the sender's peer identity and a monotonic timestamp, which the receiving peer MUST keep track of each peer identity to reject possible replay attacks.¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|                                               |
|                 SENDER PEER ID                |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                                               |
|                 SIGNATURE                     |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
/                PAYLOAD                        /
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 2: The binary representation of the KX Confirmation header

SENDER PEER ID: A 256-bit EdDSA public key (pkSed).¶
SIGNATURE: The EdDSA signature is computed with the peer private key over the session metadata, as detailed in Figure 3.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶
PAYLOAD: The message payload data.¶

The confirmation header also includes a signature over the session's metadata, which is signed with the corresponding private key of the sender's peer identity. The data format over which the signature is computed is defined in Figure 3 ¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |       PURPOSE (0x0X)  |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SENDER PEER ID                |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 RECEIVER PEER ID              |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 ENC                           |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 3: The wire format of the data structure over which the signature of the UDP Confirmation header is computed.

SIZE: A 32-bit value containing the length of the signed data in bytes in network byte order.¶
PURPOSE: A 32-bit signature purpose flag in network byte order. The value of this field MUST be 33. It defines the context in which the signature is created so that it cannot be reused in other parts of the protocol, including possible future extensions. The value of this field corresponds to an entry in the GANA "GNUnet Signature Purpose" registry [GANA].¶
SENDER PEER ID: A 256-bit EdDSA public key (pkSed).¶
RECEIVER PEER ID: A 256-bit EdDSA public key (pkRed).¶
ENC: The 256-bit serialized encapsulation result enc of the KEM.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶

Upon receiving a KX message, the receiving peer decapsulates the secret key MSK using MSK <- Decap(skR,enc), where skR is the X25519 private key derived from its Ed25519 counterpart skRed. This Decap(skR, enc) procedure is defined in [LSD0011]. Note that the exchange of the receiver peer identity is not within the scope of the UDP communicator's key exchange and is already assumed to be known to the sending peer. One way to exchange peer identities is through the UDP BROADCAST messages as described in Section 6.5.¶

The MSK is then used together with a sequence number SEQ to derive symmetric encryption key K and initialization vector IV using the "SetupCipher" procedure outlined below. Both the sending and the receiving peer SHOULD store the master shared secret MSK and attribute it to the corresponding peer. In case of an acknowledgment from the receiving peer, the established MSK can be reused by iteratively increasing the sequence number SEQ for SetupCipher(MSK, SEQ).¶

Additional data might be inserted after the confirmation header as part of the encrypted data of the KX message. Padding may be necessary due to the use of AES-GCM. Once a KX message is received and validated, the peer SHOULD try to acknowledge the established MSK to switch to a stable session. The details about the acknowledgment process and subsequent message exchange can be found in Section 6.3.¶

6.2. Key schedule

Once a shared secret MSK is established through the Elligator KEM, a symmetric key and initialization vector are derived. According to a key schedule from a 32-bit sequence number SEQ (in network byte order) and the MSK. The initial value of the sequence number is 0.¶

SetupCipher(MSK,SEQ):
  K := HKDF-Expand (MSK, "gnunet-communicator-udp-key"||SEQ, 32)
  IV := HKDF-Expand (MSK, "gnunet-communicator-udp-iv"||SEQ, 12)
  return K,IV

SetupCipher returns a 256-bit AES key "K" as well as a 96-bit "IV" for use in AES-GCM.¶

Each derived key is uniquely identified using a separately derived 256-bit key ID (KID) derived in a similar fashion:¶

DeriveKID(MSK,SEQ):
  KID := HKDF-Expand (MSK, "gnunet-communicator-udp-kid"||SEQ, 32)
  return KID

The sequence number SEQ for any shared secret is initially 0 and incremented on the sender side for each successive encryption and on the receiver side for each decryption.¶

6.3. Message exchange

KX messages, as presented in Section 6.1, are sufficient for transferring arbitrary amounts of data. This way of communicating is slow due to the establishment of a shared secret for each message using asymmetric cryptography. The UDP communicator offers a faster way of communicating by reusing a shared secret. For this purpose, the receiver of a message SHOULD acknowledge the reception to signal the sender that the same shared secret can be reused. The sender can then use the acknowledged shared secret and increment the utilized sequence number for each subsequent message to derive new symmetric key material. These messages are sent as BOX messages, which incorporate a KID as defined in Section 6.2 to identify both the master shared secret and sequence number. The wire format of a BOX message is depicted in Figure 4.¶

0           8           16          24
+-----+-----+-----+-----+-----+-----+-----+-----+
|                     KEY ID                    |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                    GCM TAG                    |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
/                 ENCRYPTED DATA                /
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 4: The binary representation of the UDP Box message.

KEY ID: A 256-bit value containing the KID of the symmetric key to use for decryption as derived using DeriveKID as described in Section 6.2.¶
GCM TAG: A 128-bit GCM tag used to authenticate the ciphertext immediately following this TCP Box header.¶
ENCRYPTED DATA: The remaining data (as indicated by SIZE) is AES-GCM encrypted using the derived key and IV identified by the KID.¶

An acknowledgment can be sent in various ways is ultimately decided by TRANSPORT. If the target peer can also reach the sending peer via UDP messages, both KX messages or BOX messages could be utilized to send the acknowledgment as their payload. TRANSPORT could also choose to utilize another communicator type to send the acknowledgment (backchannel). Either way, acknowledgments are always sent in form of an ACK header. The wire format of the ACK header can be seen in Figure 5.¶

0                       16
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |       TYPE (0x0X)     |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SEQ ACK                       |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MSK HASH (fromerly CMAC)       |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 5: The wire format of an ACK header.

SIZE: A 16-bit value containing the length of the message in bytes in network byte order.¶
TYPE: A 16-bit signature type flag in network byte order. The value of this field MUST be 1460.¶
SEQ ACK: Sequence acknowledgment limit. Specifies the current maximum sequence number supported by the receiver.¶
MSK HASH: CMAC of the base key being acknowledged.¶

To avoid having to acknowledge every single message individually, the sender of an acknowledgment can specify the allowed sequence number for the sender in the ACK header. The receiver MUST precalculate all derived keys and corresponding KIDs for which it has already sent ACKs. Consequently, for valid sequence numbers below the current ACK limit, KID should match one of the precalculated keys in the key cache, and the encrypted data can be decrypted. Otherwise, the message MUST be rejected.¶

Multiple shared secrets can be used simultaneously between the sending peer and target peer. Should the sending peer use up all acknowledgments for all its shared secrets, messages are sent through KX messages again.¶

6.4. Rekeying

The amount of data that can be encrypted with a shared secret MUST be limited. Before the capacity of a shared secret is used up, the sender initiates rekeying by sending a new ephemeral public key for a key exchange. As multiple shared secrets can be used simultaneously, rekeying doesn't necessarily delete the old shared secret if its capacity is not yet reached. The ephemeral public key is sent encrypted in a Rekey header as part of the payload of BOX message. Because the ephemeral public key is encrypted, there is no need to use Elligator's encoding function and we use the normal, unobfuscated DHKEM(X25519, HKDF-SHA256). The wire format of the Rekey header can be seen in Figure 6.¶

0                       16
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |       TYPE (0x0X)     |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                     ENC                       |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 6: The wire format of a Rekey header.

SIZE: A 16-bit value containing the length of the message in bytes in network byte order.¶
TYPE: A 16-bit signature type flag in network byte order. The value of this field MUST be 1462.¶
ENC: The 256-bit serialized encapsulation result enc of the KEM.¶

Additional data might be inserted after the Rekey header as part of the encrypted data of the BOX message. Padding may be necessary due to the use of AES-GCM.¶

6.5. Broadcast

BROADCAST messages are sent by peers to announce their presence. Those messages are only distributed in the LAN and sent in cleartext.¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|                                               |
|                 SENDER PEER ID                |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                                               |
|                 SIGNATURE                     |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 7: The binary representation of the udp BROADCAST message

SENDER PEER ID: A 256-bit EdDSA public key (pkSed).¶
SIGNATURE: The EdDSA signature is computed with the announced peer private key over the peer identity and address hash, as depicted in Figure 8.¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |       PURPOSE (0x0X)  |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SENDER PEER ID                |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 ADDRESS HASH                  |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 8: The wire format of the data structure over which the signature of the UDP BROADCAST message is computed.

SIZE: A 32-bit value containing the length of the signed data in bytes in network byte order.¶
PURPOSE: A 32-bit signature purpose flag in network byte order. The value of this field MUST be 34. It defines the context in which the signature is created so that it cannot be reused in other parts of the protocol, including possible future extensions. The value of this field corresponds to an entry in the GANA "GNUnet Signature Purpose" registry [GANA].¶
SENDER PEER ID: A 256-bit EdDSA public key (pkSed).¶
ADDRESS HASH: Hash of the sender's UDP address.¶

7. TCP communicator

TCP communicators always establish an encrypted and bi-directional communication channel. For each direction of communication, a dedicated shared secret is used to both encrypt and authenticate messages. These shared secrets are exchanged during the initial handshake. After a certain amount of data has been transmitted, rekeying occurs to renew the key material. Note that the rekeying process is triggered individually for each communication direction.¶

To achieve a zero-plaintext design, we MUST use the mac-then-encrypt approach to hide the message size on the wire. Extra caution needs to be taken due to the vulnerability of the mac-then-encrypt design to padding oracle attacks. To mitigate this issue, the TCP communicator uses AES-CTR for encryption, which does not require padding. Additionally, the use of ephemeral keys combined with monotonic timestamps limits an attacker's ability to exploit the oracle, as replay attacks are prevented.¶

7.1. Handshake

The main purpose of the handshake is to establish shared key material for each direction of the communication channel. The initiating TCP Communicator starts the handshake by sending an encapsulation from the Elligator KEM defined in Section 5.1.¶

The encapsulation MUST be directly followed by an encrypted TCP handshake message, as shown in Figure 9. In addition to the peer identity of the sender and a timestamp, it contains a nonce as a challenge for the receiving TCP communicator. All data is authenticated with a signature.¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SENDER PEER ID                |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SIGNATURE                     |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                     NONCE                     |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 9: The binary representation of the TCP handshake message.

SENDER PEER ID: A 256-bit EdDSA public key (pkSed).¶
SIGNATURE: A 512-bit EdDSA signature. The signature is calculated over the data as defined in Figure 10.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶
NONCE: A 256-bit random value used as a challenge to be signed in a TCP handshake acknowledgment message.¶

The data scheme used for computing the signature is depicted in Figure 10.¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |       PURPOSE (0x0X)  |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SENDER PEER ID                |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 RECEIVER PEER ID              |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                REPRESENTATIVE                 |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                     NONCE                     |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 10: The wire format used for creating the signature of the tcp handshake message.

SIZE: A 32-bit value containing the length of the signed data in bytes in network byte order.¶
PURPOSE: A 32-bit signature purpose flag in network byte order. The value of this field MUST be 31. It defines the context in which the signature is created so that it cannot be reused in other parts of the protocol, including possible future extensions. The value of this field corresponds to an entry in the GANA "GNUnet Signature Purpose" registry [GANA].¶
SENDER PEER ID: A 256-bit EdDSA public key.¶
RECEIVER PEER ID: A 256-bit EdDSA public key.¶
REPRESENTATIVE: The 256-bit serialized encapsulation result of the KEM.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶
NONCE: A 256-bit random value.¶

Upon reception of the ephemeral public key, the receiving TCP communicator carries out the decapsulation step of the key exchange and retrieves the shared key material. The subsequently received TCP handshake message is then decrypted and verified. If the signature is invalid, the connection is dropped. In the case of a valid signature, the receiving TCP communicator sends its own TCP handshake message to establish shared key material for outgoing messages and also replies with an encrypted TCP handshake acknowledgment message as defined in Figure 11.¶

0           8           16          24
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |        TYPE (0x0X)    |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SENDER PEER ID                |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 RECEIVER PEER ID              |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SIGNATURE                     |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                     NONCE                     |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 11: The binary representation of the tcp handshake acknowledgment message.

SIZE: A 16-bit value containing the length of the message in bytes in network byte order.¶
TYPE: A 16-bit signature type flag in network byte order. The value of this field MUST be 1453.¶
SENDER PEER ID: A 256-bit EdDSA public key.¶
RECEIVER PEER ID: A 256-bit EdDSA public key.¶
Signature: A 512-bit EdDSA signature. The signature is calculated over the data as defined in Figure 12.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶
NONCE: A 256-bit random value.¶

The data scheme used for computing the signature for the acknowledgment message is depicted in Figure 12.¶

0           8           16          24
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |        TYPE (0x0X)    |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SENDER PEER ID                |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 RECEIVER PEER ID              |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                     NONCE                     |
|                                               |
/                                               /
/                                               /
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 12: The wire format used for creating the signature of the tcp handshake acknowledgment message.

SIZE: A 16-bit value containing the length of the message in bytes in network byte order.¶
TYPE: A 16-bit signature type flag in network byte order. The value of this field MUST be 39.¶
SENDER PEER ID: A 256-bit EdDSA public key.¶
RECEIVER PEER ID: A 256-bit EdDSA public key.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶
NONCE: A 256-bit random value.¶

The initiating TCP communicator also replies with a TCP handshake acknowledgment message after receiving a valid TCP handshake message. Lastly, each party verifies both the signature and the challenge within the received TCP handshake acknowledgment message, thus completing the handshake.¶

7.2. Key exchange

During the initial handshake, each communication channel performs the Elligator KEM defined in Section 5.1. The resulting shared secret us used in an AES-CTR encryption with HMAC-SHA512. Subsequent key exchanges are performed with each rekeying. More about the initial handshake and rekeying can be found in Section 7.1 and Section 7.4, respectively.¶

Let (REC_SK, REC_ID) be the receiver peer's EdDSA key pair. The sender peer initiates the key exchange using the Elligator KEM from Section 5.1 resulting in an encapsulation and initial master secret key MSK. MSK is used to derive a symmetric encryption and HMAC key as well as an initialization vector using the procedure "SetupCipher":¶

SetupCipher(REC_ID, MSK):
  K := HKDF-Expand (MSK, "gnunet-communicator-tcp-key", 32)
  IV := HKDF-Expand (MSK, "gnunet-communicator-tcp-ctr, 16)
  K_mac := HKDF-Expand (MSK, "gnunet-communicator-tcp-hmac, 64)
  return K,IV,K_mac

Note that the initiating TCP communicator can immediately encrypt the first TCP handshake message when sending it. As soon as the receiving TCP communicator receives and decapsulates the representative, it can decrypt the following TCP handshake message. The same applies to the TCP handshake message sent by the receiving TCP communicator.¶

7.3. Message exchange

Once the handshake is completed, actual payloads can be exchanged bi-directionally using TCP BOX messages. A TCP Box message consists of a TCP BOX message, as defined in Figure 13, followed by the payload. Both parts are encrypted before being sent to the receiving peer.¶

TCP Box messages follow the mac-then-encrypt approach to hide the size of the payload and achieve a zero-plaintext design. The HMAC utilizes SHA512 as the underlying hash function and is ratcheted after each operation. Given the mac-then-encrypt approach, additional safeguards are needed to protect against Oracle padding attacks. Therefore, we MUST use a padding-free encryption scheme such as AES-CTR for encryption. Additionally, we restrict the attacker's ability to replay attacks by exchanging new key material after a randomly chosen amount of transferred data, as described in Section 7.4. The necessary key exchanges to establish the new key material are protected using monotonic timestamps.¶

0           8           16          24
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |        TYPE (0x0X)    |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                    HASHCODE                   |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 13: The binary representation of the TCP BOX message.

SIZE: A 16-bit value containing the length of the message in bytes in network byte order.¶
TYPE: A 16-bit signature type flag in network byte order. The value of this field MUST be 1451.¶
HASHCODE: A 256-bit HMAC-SHA512 hashcode for the subsequently sent payload.¶

7.4. Rekeying

After each key exchange, up to 400 MB of data is transferred until rekeying is triggered by the sender of the communication direction. The actual amount of transferred data SHOULD be chosen randomly. If the chosen byte quantity is not reached after one day, rekeying is set off anyway.¶

The receiving communicator is signaled about a rekeying through the dispatch of a TCP Rekey message, as defined in Figure 14. The message MUST be encrypted with the current key. Due to the encryption of the message, the encoding of the new ephemeral public key with Elligator is not needed. Similarly to the initial handshake, the ephemeral public key is used to perform a key exchange from which new key material for the encryption and authentication code scheme is derived. For further details, please refer to Section 7.2. Note that the rekeying process doesn't involve an acknowledgment by the receiver of a TCP Rekey message. So the sender might send new payload encrypted by the new key right after sending the TCP Rekey message.¶

0           8           16          24
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |        TYPE (0x0X)    |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                    HASHCODE                   |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                       ENC                     |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SIGNATURE                     |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 14: The binary representation of the TCP Rekey message.

SIZE: A 16-bit value containing the length of the message in bytes in network byte order.¶
TYPE: A 16-bit signature type flag in network byte order. The value of this field MUST be 1450.¶
HASHCODE: A 256-bit HMAC-SHA512 hashcode of this TCP Rekey message. The hashcode is computed with the hashcode field initially set to zero and is inserted afterward.¶
ENC: The 256-bit serialized encapsulation result enc of the KEM.¶
Signature: A 512-bit EdDSA signature. The signature is calculated over the data as defined in Figure 15.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |       PURPOSE (0x0X)  |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 SENDER PEER ID                |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 RECEIVER PEER ID              |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                      ENC                      |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                MONOTONIC TIMESTAMP            |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 15: The wire format used for creating the signature of the TCP Rekey message.

SIZE: A 32-bit value containing the length of the signed data in bytes in network byte order.¶
PURPOSE: A 32-bit signature purpose flag in network byte order. The value of this field MUST be 32. It defines the context in which the signature is created so that it cannot be reused in other parts of the protocol, including possible future extensions. The value of this field corresponds to an entry in the GANA "GNUnet Signature Purpose" registry [GANA].¶
SENDER PEER ID: A 256-bit EdDSA public key (pkSed).¶
RECEIVER PEER ID: A 256-bit EdDSA public key (pkRed).¶
ENC: The 256-bit serialized encapsulation result enc of the KEM.¶
MONOTONIC TIMESTAMP: A 64-bit value for the absolute time used by GNUnet, in microseconds and in network byte order.¶

8. HTTP/3 communicator

The HTTP/3 [RFC9114] communicator operates over a bidirectional communication channel, with the client initiating the connection and the server on the receiving end. Once the connection is successfully established, messages are transmitted via POST and GET requests, and all communication is secured using TLS.¶

Upon successfully establishing an HTTP/3 connection, the client MUST immediately transmit its PeerIdentity in a POST. The server will store this PeerIdentity to identify the client. Following this exchange, data exchange between the client and server can proceed.¶

When the client needs to send a message, it uses a POST request to transmit the data to the server. While the server cannot initiate messages independently, it can include data in its response to a client's POST request.¶

To enable the server to proactively send data when the client has no data to transmit, long polling is used. The client sends GET requests to the server, which MAY not immediately respond but instead sets a timer for each request. The server responds either when the timer expires or when there is data to send. Upon receiving a response, the client immediately sends a new GET request to maintain an adequate number of long polling requests with the server.¶

0     8     16    24    32    40    48    56
+-----+-----+-----+-----+-----+-----+-----+-----+
|         SIZE          |         TYPE (0x0X)   |
+-----+-----+-----+-----+-----+-----+-----+-----+
|                 MESSAGE                       |
|                                               |
|                                               |
|                                               |
+-----+-----+-----+-----+-----+-----+-----+-----+

Figure 16: The wire format of exchanged messages.

SIZE: A 16-bit value containing the length of the signed data in bytes in network byte order.¶
: A 16-bit type flag in network byte order. The value of this field MUST be XY. The value of this field corresponds to an entry in the GANA "GNUnet Message Type" registry [GANA].¶
MESSAGE: The message.¶

Example POST:¶

:method: POST
:scheme: https
:authority: PEER'S IP ADDRESS
:path: /
content-type: application/octet-stream
content-length: LENGTH OF MESSAGE

If server no data to send it will respond with HTTP status OK (200). If server has data to send it will respond with HTTP status OK and a response body with a message (Figure 16) and content type application/octet-stream¶

GET request example (long polling):¶

:method: GET
:scheme: https
:authority: PEER'S IP ADDRESS
:path: /

If server no data to send and the long poll times out it will respond with status 204. If server has data to send it will respond with HTTP status OK (200) a response body with a message (Figure 16) and content type application/octet-stream.¶

8.1. Handshake

The public keys in the certificates in use as part of the TLS handshake are not verified or evaluated against a trust store. The initial message by the initiating peer (the HTTP/3 client) will contain the peer identity. In the future, the peer identity should be part of the TLS handshake instead.¶

9. libp2p communicator

The libp2p communicator uses libp2p as a means of communication. libp2p is "a modular networking framework bundled together as a full stack of protocols for peer-to-peer systems" and thus shares the general approach and a similar vision. The first goal would have been to integrate libp2p directly below core as an alternative/addition to the transport service as a first step of trying to get two different p2p networks and their implementations closer to each other. The long term goal would be to make the two networks compatible with each other and on the way learn from the respectively other network and apply the learnings to the respective on network. But it became soon clear that this was too ambitious for the short term and the short term goal was simplified to make libp2p a communicator for transport without loosing the perspective of integrating the two networks further in the future. Interesting insights include performance (in terms of throughput, memory consumption, cpu usage, ...), capabilities (NAT traversal, peer discovery, ...), different approaches to abstract similar underlying concepts, data structures employed and the usage of different schemes itself. If both networks would start to implement compatibilities to each other, both could profit from elements only the other has implemented.¶

In order to address another peer and open a connecting, the libp2p multiaddress of the other peer or peer discovery is required. A libp2p multiaddress follows a very similar idea to a gnunet HELLO. Integrating the one with the other or finding a common abstraction would make a lot of sense in the mid to long term. For a libp2p communicator the easiest is to integrate a libp2p multiaddress just as one of the underlying transport/communicator addresses. The documentation of the libp2p multiaddress can be found here: https://docs.rs/libp2p/latest/libp2p/struct.Multiaddr.html¶

One of the first consideration, when connecting the two networks, is the language. libp2p has implementations in many languages which all have different capabilities. (An overview is provided at https://libp2p.io/implementations.) In order to decide on the implementation mainly two factors have to be considered: Which features are needed and how difficult is the technical connection of the implementation? Depending on the implementation different wrappers and converters have to be implemented before being able to use them.¶

Gnunet HELLO and libp2p multiaddress: The basic element to connect two gnunet peers via their libp2p communicators would be to exchange their libp2p multiaddress as part of a gnunet HELLO. Both represent a collection of different, underlying protocols, the respective implementations communicate over. Due to the similarity in representation, an adaption is needed to either (a) properly embed the one in the other without parsing issues, or (b) being able to directly parse both. For the specification of libp2p multiaddresses see https://github.com/libp2p/specs/tree/master/addressing For just connecting two gnunet nodes via libp2p communicators, the main adaption needed is the compatibility between addressing schemes, as gnunet internally processes (signs with its peer id) the underlying addresses in order to exchange them via its bootstap system.¶

Peer IDs: While gnunet only uses a single cryptographic primitive (EdDSA curve Ed25519) keys, libp2p has the option of using different keys. While not necessary if the connection is only as a communicator below transport, there might be the possibility of a direct compatibility at this level. (Probably a conversion in encoding/representation has still to be done.) The difference being that libp2p peer ids are stable, gnunet peer ids can change. For more see the libp2p spec on peer-ids: https://github.com/libp2p/specs/blob/master/peer-ids/peer-ids.md¶

DHT: The Distributed Hash Tables of both networks are a bit different and as such not directly compatible. libp2p implements Kademlia (https://github.com/libp2p/specs/tree/master/kad-dht), whereas gnunet implements R5N (https://docs.gnunet.org/latest/users/subsystems.html#dht-distributed-hash-table).¶

The GNUnet communicators

Abstract

Status of This Memo

Copyright Notice

Table of Contents

1. Introduction

1.1. Requirements Notation

2. Terminology

3. Notation

4. Overview

5. Cryptographic dependencies

5.1. Elligator DHKEM

5.2. Key derivation