RFC 4103: RTP Payload for Text Conversation

2026-05-01

RFC: RFC 4103

Published: 2005

Authors: Gunnar Hellström, Paul Jones

You type a message in a chat app and hit send. Your recipient reads the whole thing at once. But imagine a phone call where every keystroke you type appears on the other person's screen in real time, character by character, as you think and compose — no send button required. That is Real-Time Text (RTT), and RFC 4103 is the protocol that makes it work over IP networks.

The problem it solves: For deaf and hard-of-hearing people, text-based communication over telephone networks has been essential since the 1960s, when TTY/TDD devices first appeared. These devices transmitted characters one at a time over analog phone lines, enabling conversational text with the same immediacy as voice. When telephony moved to IP (VoIP, SIP, IMS), there was no equivalent mechanism. Store-and-forward messaging like SMS or IM wasn't a substitute — it broke the real-time, conversational flow that TTY users depended on. RFC 4103 bridged that gap.

How it works: The RFC defines an RTP payload format for carrying ITU-T T.140 text — a UTF-8-based protocol for multimedia conversational text. The key design decisions are:

Why it matters today: RFC 4103 is far from a dead spec. It is mandated in multiple regulatory frameworks:

If you've ever seen the "RTT" option in your phone's call settings and wondered what it was, this is the answer. It's also a fascinating case study in protocol design for accessibility: the authors had to balance packet efficiency, real-time latency requirements, and reliability in a way that voice codecs never worry about, because every single character matters.

The spec is also a reminder that "real-time" means different things in different contexts. For voice, 20ms packet intervals are standard. For text, 300ms is perfectly fine — human typing speed is the bottleneck, not the network. Matching protocol parameters to human factors rather than raw capability is an underrated design skill.

Why it matters: RFC 4103 is the protocol that lets deaf and hard-of-hearing users make real-time text "phone calls" over IP networks — it's mandated by regulators, built into every modern smartphone, and is a masterclass in designing protocols around human accessibility needs rather than pure engineering metrics.

All newsletters