this is not a stream of bytes, it's a stream of access units.
Input buffers (for decoders) and Output buffers (for encoders) contain encoded data according to the format’s type. For video types this data is all the encoded data representing a single moment in time, for audio data this is slightly relaxed in that a buffer may contain multiple encoded frames of audio. In either case, buffers do not start and end on arbitrary byte boundaries, this is not a stream of bytes, it’s a stream of access units.
“这不是字节流,而是访问单元流”
H.264 视频流
text
字节流视角: [00 00 00 01 67 … 00 00 00 01 68 … 00 00 00 01 65 …]
访问单元视角: [SPS] [PPS] [I帧] [P帧] [P帧] …
每个 [ ] 都是一个访问单元
必须以 00 00 00 01 或 00 00 01 开始
包含完整的编码帧信息