AI Coding Assistants and HIPAA: Using Copilot and Cursor Safely in Healthcare

Q: Is GitHub Copilot HIPAA compliant?

No tool is HIPAA compliant on its own; compliance depends on how it is used. As of mid-2026, general-purpose Copilot tiers are not marketed as HIPAA-eligible and typically do not come with a Business Associate Agreement. Use such a tool only if you ensure it never receives PHI, or if the vendor will execute a BAA for your tier. Verify current contract terms with the vendor.

Q: How do we let developers use AI tools without violating HIPAA?

Keep PHI out of the codebase with synthetic data and automated scanning, disable model training and telemetry, prefer enterprise tiers with contractual data-handling commitments, and govern every AI tool through your Security Risk Analysis and third-party risk process.

AI coding assistants like GitHub Copilot, Cursor, and Amazon Q have moved from novelty to default inside healthcare engineering teams. They speed up development, but they also introduce a new path for protected health information (PHI) to leave a controlled environment — through prompts, code context, logs, and model training pipelines. HIPAA does not ban these tools. It does require that any service touching electronic PHI be governed like any other vendor: under a Business Associate Agreement, inside a documented risk analysis, with technical controls that match the risk. This guide explains where the exposure actually is and how a healthcare organization can let developers use AI assistants without creating an unmanaged compliance gap.

Where PHI actually leaks into a coding assistant

The intuitive worry — “a developer pastes a patient record into the chat box” — is the least common failure. The realistic exposure is quieter. AI coding assistants send surrounding code context to a remote model to generate suggestions. If a repository contains real PHI in test fixtures, seed data, hard-coded sample records, log snippets, or database migration files, that data can be transmitted as context without anyone consciously “sharing” it. Debugging sessions are another route: an engineer pastes a stack trace or a failing query that happens to include a real patient identifier. Inline comments describing a production incident (“patient 4471 saw duplicate claims”) become training-adjacent text. None of these feel like disclosing PHI, which is precisely why they evade review.

The governing question under HIPAA is not whether a human intended to share PHI. It is whether electronic PHI was transmitted to or maintained by a third party. The HHS Office for Civil Rights treats a vendor that creates, receives, maintains, or transmits PHI on behalf of a covered entity or business associate as a business associate itself (45 CFR 160.103). A coding assistant that ingests a repository containing PHI fits that definition, regardless of developer intent.

The BAA question: most general-purpose assistants don’t offer one

Under the HIPAA Privacy and Security Rules, a covered entity may only disclose PHI to a business associate with documented “satisfactory assurances” captured in a written contract (45 CFR 164.502(e) and 45 CFR 164.308(b)). The practical test for any AI coding assistant is simple: will the vendor sign a Business Associate Agreement covering the specific tier you are using?

As of mid-2026, the major general-purpose coding assistants are not marketed as HIPAA-eligible services and typically do not sign BAAs for their standard developer tiers. Enterprise tiers may offer stronger data-handling commitments — no prompt retention, no training on customer code, regional processing — but a data-protection addendum or a “we don’t train on your data” promise is not the same as a BAA, and an organization should never assume one implies the other. If a vendor will not execute a BAA, the only compliant posture is to ensure the tool never receives PHI at all. That is a technical and process control, not a checkbox.

This is the same diligence gap covered in our guide to why a signed BAA isn’t the same as a vendor risk assessment: the contract allocates liability, but it does not prove the tool is safe to use. Both are required.

A practical control model for development teams

1. Keep PHI out of the codebase

The strongest control is to make sure repositories never contain real PHI. Use synthetic test data, scrub production exports before they reach development, and add automated secret- and PHI-scanning to CI so a real identifier in a fixture fails the build. If the assistant only ever sees synthetic data, the BAA question becomes far less acute — though governance documentation is still required.

2. Configure the tool, don’t trust the default

Disable model training on your content where the vendor allows it, restrict the assistant to approved repositories, turn off telemetry that transmits code snippets, and prefer enterprise tiers that contractually exclude your data from training. Document each setting — an auditor will ask how you know training is disabled, and “the vendor’s marketing page said so” is a weak answer without a contract behind it.

3. Govern AI tools through your existing program

An AI coding assistant is a vendor and an information system. It belongs in your asset inventory, your HIPAA Security Risk Analysis, and your third-party risk management process. Shadow adoption — where individual engineers install assistants the security team never evaluated — is the single biggest source of unmanaged AI risk in healthcare today. A workable AI governance program names which tools are approved, for which data, under which configuration, and reviews that list on a schedule.

How this connects to broader AI governance

Coding assistants are one instance of a larger pattern: powerful, easy-to-adopt AI tools that quietly become business associates the moment they touch PHI. The same logic applies to AI scribes, chat assistants, and analytics copilots. Organizations that handle this well do not treat each tool as a one-off; they fold AI into a single governance model. Our overview of AI governance for healthcare and the companion piece on choosing HIPAA-compliant AI tools cover the policy side that makes per-tool decisions repeatable rather than ad hoc.

The bottom line: developers can use AI coding assistants in a healthcare organization, but only inside a deliberate boundary. Either the tool never sees PHI, or there is a BAA and a risk assessment that say it may. Anything in between is the gap where a breach — and an OCR investigation — starts.

Talk to Medcurity

Medcurity helps healthcare organizations bring AI tools — coding assistants included — under a documented HIPAA governance program, from risk analysis to vendor management. Explore Medcurity solutions to see how.

Frequently asked questions

Is GitHub Copilot HIPAA compliant?

No tool is “HIPAA compliant” on its own — compliance depends on how it is used. As of mid-2026, general-purpose Copilot tiers are not marketed as HIPAA-eligible and typically do not come with a Business Associate Agreement. You can use such a tool in a healthcare organization only if you ensure it never receives PHI (synthetic test data, PHI scanning in CI, no real identifiers in code or logs), or if the vendor will execute a BAA for your tier. Verify the current contract terms directly with the vendor before relying on any tool.

Can an AI coding assistant be a HIPAA business associate?

Yes. Under 45 CFR 160.103, any service that creates, receives, maintains, or transmits PHI on behalf of a covered entity or business associate meets the definition of a business associate. If your repositories, logs, or debugging sessions expose real PHI to the assistant, the vendor is functioning as a business associate and a BAA is required.

How do we let developers use AI tools without violating HIPAA?

Keep PHI out of the codebase using synthetic data and automated scanning, configure the tool to disable training and telemetry, prefer enterprise tiers with contractual data-handling commitments, and govern every AI tool through your Security Risk Analysis and third-party risk process. Approve specific tools for specific data under specific configurations, and review that list regularly to prevent shadow adoption.