-
Notifications
You must be signed in to change notification settings - Fork 37
Expand file tree
/
Copy pathquiz.json
More file actions
78 lines (78 loc) · 2.52 KB
/
Copy pathquiz.json
File metadata and controls
78 lines (78 loc) · 2.52 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
{
"lesson": "27-finops-llms",
"title": "面向 LLM 的 FinOps —— 单位经济模型与多租户归因",
"questions": [
{
"stage": "pre",
"question": "为什么传统 FinOps 在 LLM 支出上失效?",
"options": [
"LLM 不花钱",
"LLM 账单总是免费的",
"云服务商拒绝逐项列出",
"成本是 token 交易而非资源在线时长;标签不会从 API 调用自动传播,你必须在调用现场打上 user/task/tenant 戳记"
],
"correct": 3,
"explanation": ""
},
{
"stage": "check",
"question": "本课要求从第一天就埋点哪三个归因维度?",
"options": [
"按用户(user_id)、按任务(task_id + route)、按租户(tenant_id)",
"供应商、模型、API 版本",
"区域、可用区、数据中心",
"GPU、CPU、内存"
],
"correct": 0,
"explanation": ""
},
{
"stage": "check",
"question": "在成本归因中应当拆分出哪四个 token 层?",
"options": [
"prompt、tool、memory、response",
"输入、输出、网络、磁盘",
"缓存、模型、网关、可观测性",
"GPU、CPU、内存、存储"
],
"correct": 0,
"explanation": ""
},
{
"stage": "check",
"question": "在执法阶梯中,熔断开关(kill-switch)的触发条件是什么?",
"options": [
"租户支出相对基线的 z 分数 > 4;自动暂停该租户并呼叫值班",
"延迟 P50 > 2 秒",
"任何 5xx 响应",
"一分钟内支出超过 1 美元"
],
"correct": 0,
"explanation": ""
},
{
"stage": "post",
"question": "本课推荐用哪个单位指标取代「每百万 token 美元数」?",
"options": [
"每网关成本",
"每 GPU 小时成本",
"每秒成本",
"每个产品成果的成本(例如每解决一张支持工单的成本、每生成一篇文章的成本、每完成一个 agent 任务的成本)"
],
"correct": 3,
"explanation": ""
},
{
"stage": "post",
"question": "本课把哪种归因模式称为成熟团队所用的最高精度方案?",
"options": [
"采样与外推",
"基于模型的分摊",
"仅打标签并聚合",
"遥测连接器(telemetry joiner)—— 通过 trace ID 把 trace 与账单关联起来"
],
"correct": 3,
"explanation": ""
}
]
}