Question 1

What does an audit deliver?

Accepted Answer

A Technical Assessment Report, a board-ready executive summary, and a full evidence package: for every finding, observed frequency, confidence interval, trial count, and representative evidence. Plus three compliance dossiers (EU AI Act Article 15, ISO/IEC 42001, NIST AI RMF), prioritized remediation, and one bounded 30-day re-test. Delivered as a PDF. HTML, JSON, and Markdown are available on request.

Question 2

What can we tell our board and customers after the audit?

Accepted Answer

You get a board-ready executive summary written to be quoted, and a pre-approved sentence you're licensed to publish: "[Company]'s AI assistant is independently security-tested by TestMy.AI against 800+ adversarial scenarios spanning the OWASP LLM and Agentic Top 10, with quarterly re-testing." We report what was tested and found. We don't certify systems as "safe", and neither should you.

Question 3

Who is the report written for?

Accepted Answer

Security teams validating AI before production; compliance officers preparing evidence for ISO 42001 or NIST filings; legal teams preparing EU AI Act dossiers; procurement teams responding to enterprise security reviews; and executives who need to report on AI risk upward. The report has a section for each audience.

Question 4

Will testing damage production?

Accepted Answer

Tests run against the endpoint you designate, whether staging, shadow, or production, at a throttle you set. We coordinate test windows, respect your rate limits, and pause on the first 5xx pattern. No destructive payloads, no data exfiltration beyond what your model itself surfaces.

Question 5

What access do you need?

Accepted Answer

Just an endpoint URL and authentication. No source code, no model weights, no infrastructure access. The audit is performed black-box, exactly the way an external attacker would see your AI.

Question 6

If we re-run a finding ourselves, will we see the same result?

Accepted Answer

Not necessarily, and that's expected. AI systems are non-deterministic; a finding that occurs 35% of the time may not reproduce on one manual attempt. That's precisely why every attack is run multiple times and reported as a frequency rather than a yes/no. Every finding ships with its trial count and observed rate so your team knows exactly what to expect on re-runs.

Question 7

What does it mean when a test shows no findings?

Accepted Answer

Not "safe", and not "passed". It means the attack was not observed to succeed across N attempts, and testing at that depth reliably detects behaviours occurring more often than roughly 1 in 7. Rarer intermittent behaviour cannot be excluded, and we say so rather than implying a clean bill of health.

Question 8

Do you test AI agents, not just chatbots?

Accepted Answer

Yes. The OWASP Agentic Top 10 (ASI01–ASI10) is covered via adaptive, multi-iteration attack chains on every audit. Full agentic coverage requires tool-calling, RAG, or memory on the target; tests that don't apply to your architecture are marked N/A, not padded into the score.

Question 9

Do you verify what happened in our backend?

Accepted Answer

No. Testing is black-box: we observe what your endpoint returns, not what your systems did. Where the model claims to have performed an action, we report it as a model claim with supporting or contradicting signals, never as a confirmed database event.

Question 10

Is this a certification?

Accepted Answer

No. TestMy.AI is an independent technical assessor, not a certification body. The deliverable is a Technical Assessment Report with evidence designed to support your compliance filing alongside qualified legal counsel. The EU AI Act conformity-assessment framework for Article 15 is still being established; no one yet holds formal certification authority.

Question 11

What about data retention?

Accepted Answer

Test artefacts are retained encrypted for 90 days post-delivery, then destroyed on written request, or extended for your audit-trail retention period. We sign mutual NDA before scoping and never use client data to train or tune models.

Question 12

Which frameworks does it map to?

Accepted Answer

Included by default, on every audit: EU AI Act Article 15, ISO/IEC 42001, NIST AI RMF, and OWASP. Every finding is pre-mapped to all four the moment it's written up, so the same evidence answers the regulator, the procurement team, and the security review, if and when you need it.

Question 13

How long does it take?

Accepted Answer

A single audit: report delivered within 10 business days of scoping, for one endpoint. Rush turnaround compresses the window at +30%.

Question 14

What happens after we patch?

Accepted Answer

One re-test is included within 30 days, same version, failed tests only. You get a verified fix rate and the re-issued evidence. Extra re-tests are 2,000 USD during the founding window, 2,500 USD standard.

Question 15

Do you offer ongoing testing?

Accepted Answer

Yes. Continuous Assurance is the same full audit, quarterly, plus regression re-testing on material change (new model version, prompt change, new tool), with dossiers and the board summary kept current. A point-in-time audit decays in weeks; quarterly re-testing is what keeps the statement "our AI is independently tested" true.

Ship AI
with proof,
not promises.

A binder your board, your security team and your engineers can all read.

The Technical Assessment Report

The board-ready executive summary

An evidence package, quantified

Three compliance dossiers, pre-mapped

A bounded, fix-verification re-test

25 categories. 800+ ways in. Every applicable one tested, repeatedly.

Also mapped to four frameworks, included, not upsold.

EU AI Act, Article 15

ISO/IEC 42001

NIST AI RMF 1.0

OWASP LLM & Agentic Top 10

The report your board and your engineers both open.

Every finding, with its frequency.

One audit to start. Continuous coverage once you ship again.

AI Security Audit

Continuous Assurance

The plain version.

An audit before the next
board review.