Anthropic ★★ Frequent Hard SessionStreaming

A21 · Design Claude Chat Service A21 · 设计 Claude Chat 服务

Verified source经核实出处

Prompt: "Design Claude Chat Service." — interviewing.io. Credibility C/D.

Core concerns核心考察

  • Session management (long-lived conversations with token limits).会话管理(带 token 限制的长对话)。
  • Streaming output over SSE / WebSocket.SSE / WebSocket 流式输出。
  • Token-level billing (metered at worker after tokenization).Token 级计费(tokenization 后按 worker 计)。
  • Log aggregation (conversation + safety annotations for audit).日志聚合(对话 + safety 注释用于审计)。
  • Safety pipeline integration (input filter, output filter, content policy).Safety 管道集成(输入过滤、输出过滤、内容策略)。

Architecture架构

flowchart LR
  C[Client] --> GW[Gateway]
  GW --> SESS[Session Service]
  SESS --> CONV[(Conv Store)]
  GW --> SAFE[Safety Input Filter]
  SAFE --> INF[Inference]
  INF --> SAFE2[Safety Output Filter]
  SAFE2 --> GW
  GW --> METER[Usage Metering]

See A11 for inference depth and A19 for prompt versioning.推理深度参考 A11,Prompt 版本化参考 A19。

Related study-guide topics相关学习手册专题