AI Articles Archive

AI agent security (12) AI agent security (12)

Prompt Injection in LLM Applications: Boundary Failures and Controls

Published: 2026-06-21

An architecture-level mapping of prompt-injection-related LLM application failure modes to OWASP LLM Top 10 and NIST AI risk-management categories.
8 Trust-Boundary Audit Checkpoints for Agentic Systems

Published: 2026-06-09

A practical checklist for auditing eight trust-boundary checkpoints where untrusted artifacts can steer routing, tool use, and write-path actions in chained LLM systems.
Gmail and WhatsApp AI Agents: Private-Message Risks and Tool-Action Controls

Published: 2026-05-28

A technical analysis of the security risks created when AI agents can read, interpret, route, or act on Gmail, WhatsApp, private message threads, attachments, links, and communication workflows.
Connected Apps and MCP Security: Capability Scope and Side-Effect Risk

Published: 2026-03-30

Security analysis of connected apps, external tools, and remote MCP servers as capability, scope, approval, disclosure, and side-effect control surfaces.
Web Retrieval Prompt Injection Boundary in LLM Systems

Published: 2026-03-25

A threat model for browsing-enabled and tool-using LLM systems where retrieved web content can steer routing, tool arguments, follow-up calls, or side effects.
LLM Boundary Assurance Failures: Client-Captured Security Report

Published: 2026-02-22

Client-observed security report on text-only confirmations of privileged state or actions where the public article does not include signed backend audit artifacts. Backend state changes are not independently verified in this public report.
AI Agent Orchestration Loops: Attack Surface and Control-Plane Enforcement

Published: 2026-02-22

How multi-step orchestration (controller) loops change the threat model in tool-using systems, and where to enforce separation, authorization, validation, and budgets to reduce prompt injection, tool misuse, unsafe writes, and unbounded consumption.
LLM Prompt Assembly Security: Separating Policy from Untrusted Content

Published: 2026-02-22

An engineering guide to preventing authority confusion in prompt assembly by separating authoritative policy from untrusted content with typed provenance.
Social Engineering in AI Systems and Decision Pipelines

Published: 2026-02-22

Threat model of social engineering against AI decision pipelines; maps prompt injection to enforcement controls outside the model (PDP/PEP, validation, budgets).
LLM Integration Trust Boundary: Threat Modeling Before AI Agents

Published: 2026-02-22

Why agent-layer threat modeling is incomplete: the first high-leverage control point is the LLM integration trust boundary (before agent frameworks exist).
Request Assembly Threat Model for AI Agents

Published: 2026-02-22

A reviewer-oriented threat model for request assembly in AI assistants: what enters context, what gets prioritized or dropped, and where policy, tool, memory, retrieval, and audit checkpoints should be reviewed.
Tool-Using LLM Systems: Privilege Bleed and Integrity-Signal Failures

Published: 2026-02-22

Two vendor-agnostic control-plane failure patterns—privilege persistence across interaction boundaries and non-enforcing integrity signals—that allow untrusted state to steer tool execution across steps.

AI agent architecture (4) AI agent architecture (4)

Parallel Reasoning in LLM Systems: Orchestration, Not Native Decoding

Published: 2026-04-13

Why multi-path reasoning in LLM systems usually comes from inference-time orchestration rather than ordinary single-pass autoregressive decoding.
Tool Execution in LLM Systems: LLM-Led vs Orchestrator-Led Control

Published: 2026-02-22

A control-plane placement comparison for tool-using LLM systems, covering reliability, observability, latency, cost governance, and security.
LLM Memory Boundaries: Context, Persistence, and Answer Drift

Published: 2026-02-22

A vendor-agnostic model of context construction—what can enter context, what gets used per response, what is retained for later, and which security controls must live outside the prompt.
LLM Capability Gaps: Engineering Substitutes and Residual Risks

Published: 2026-02-22

A practical mapping of human cognitive capabilities to GenAI limitations, engineering substitutes, and residual gaps.

LLM evaluation (7) LLM evaluation (7)

How AI Tools Read Emotional Signals in Text

Published: 2026-06-07

A mechanism-first explanation of textual emotional signals in AI chat and agentic systems: signal interpretation, response adaptation, failure modes, and the authority boundary.
When Human-Like Signals Fail-Cue Misalignment in Clowns and AI-Generated Outputs

Published: 2026-05-13

Why clowns and some AI-generated outputs can feel unsettling: not because they are simply strange, but because they imitate human cues while disrupting the signals people rely on to read emotion, intent, realism, and coherence.
Observed Classification Layers in ChatGPT

Published: 2026-04-29

A client-side black-box analysis of observed ChatGPT classification artifacts, separating user access, prompt demand, and capability allocation.
Theory of mind in LLMs — what benchmarks test (and what they don’t)

Published: 2026-02-22

Evidence-anchored overview of how ToM is defined in psychology, how it is operationalized for LLM evaluation, and what current results do and do not justify.
Sycophancy in LLM Assistants

Published: 2026-02-22

A technically grounded explanation of sycophancy: what it is, what evidence supports, how preference optimization can produce it, and how release practice can reduce it.
Orders of Intentionality and Recursive Mindreading Definitions and Use in LLM Evaluation

Published: 2026-02-22

A precise reference for nested mental-state attribution (“orders of intentionality” / “recursive mindreading”) and how these constructs are operationalized in evaluations of humans and LLMs—without implying mechanism-level Theory of Mind.
Fluency Is Not Factuality Why LLMs Can Sound Right and Be Wrong

Published: 2026-02-22

Why fluent LLM outputs can still be wrong, and how to enforce evidence-locked answers (retrieval + provenance + fail-closed gates).

Prompt engineering (4) Prompt engineering (4)

File Upload Is Not Full-File Review

Published: 2026-05-19

Why AI summaries, edits, extractions, and content drafts can fail before generation begins: file upload, source retrieval, active context, and full-file review are different things.
Vibe Coding Risk: Engineering Ownership and Review Controls

Published: 2026-05-12

Vibe coding is not risky because AI can generate code. The risk starts when AI-generated code is approved without sufficient comprehension, review, security validation, and long-term ownership.
Using ChatGPT Effectively at Work: A Practical Guide

Published: 2026-04-03

A practical guide to choosing the right ChatGPT layer for work: modes, search, deep research, agent mode, personalization, memory, and projects.
Prompt Engineering Guide for Daily Work (Deep Dive)

Published: 2026-02-22

A deep dive into why prompts fail in daily work, how to design evidence-bounded prompt specifications (grounded outputs), and how to evaluate them.

Get new AI resources by email