← Back to Contents
Chapter A

LT Evaluation Checklists

Task Fitness

  • One-sentence answer on top
  • Runnable artifact included
  • Verification steps listed
  • Inputs and sources named
  • Risk and rollback described
  • Next action proposed

Output Quality

  • Tight vocabulary for the domain
  • Fluency without filler
  • Compression to what matters
  • Clear boundary between facts and inference

Safety and Access

  • Least-privilege access used
  • Changes logged with diffs
  • Restore plan verified