Legal Technology

Why Copilot isn't the answer for legal teams

Q: Is Copilot ever appropriate for legal work?

Copilot can help with general drafting, summarising and productivity tasks, but it is not purpose-built for legal work. Legal teams should be cautious when using it for legal advice, contract review, risk assessment or compliance decisions.

Q: What makes a legal AI tool purpose-built?

A purpose-built legal AI tool is designed around legal workflows, risk controls, auditability, permissions, legal playbooks and organisation-specific context. It helps legal teams manage work with greater confidence, rather than simply generating generic content.

Q: How do I make the case internally to move beyond Copilot for legal work?

Focus on the risks and limitations of using a generalist AI tool for legal work. A dedicated legal AI platform can provide stronger governance, legal context, workflow automation, matter visibility and repeatable risk controls for the business.

Q: What is the risk of relying on horizontal AI for legal work?

Horizontal AI tools may lack the legal context, permissions, audit trails and risk frameworks required for legal work. This can create issues with accuracy, confidentiality, consistency and accountability.

Q: How do we handle the sunk cost of existing Microsoft licences?

Existing Microsoft licences can still support general productivity. A purpose-built legal AI platform should be positioned as a specialist layer for legal work, helping teams manage risk, automate workflows and deliver legal outcomes more effectively.

Ninety-five percent of general counsel expect AI to be central to their work within five years. Only 6.7% have fully operationalised it.

Andrew Mellett

May 26, 2026

CONTENTS

The core problem
The context gap
The risk of doing nothing

Get to know Plexus AI

Take a platform tour

Microsoft Copilot has had more than two years on the market, the backing of over $80 billion in AI infrastructure spend, and access to 450 million Microsoft 365 seats. It is the most aggressively distributed AI product in enterprise software history. And only 3.3% of those seats are paying for it.

That is not a distribution problem. Microsoft has the best enterprise distribution in the world. It is a value problem, and for in-house legal teams specifically, it is a structural one.

The data on Copilot performance makes uncomfortable reading for anyone who has already rolled it out, or is being asked to. Accuracy NPS has fallen to negative 19.8, down from negative 3.5 in just six months. Distrust is the primary reason for churn, cited by 44.2% of lapsed users. Paid market share in the US dropped 39% between July 2025 and January 2026. And perhaps most strikingly, Microsoft's own terms of use describe Copilot as being for entertainment purposes only, with an explicit warning not to rely on it for important advice.

For consumer use, that disclaimer is acceptable. For a general counsel advising on a commercial contract, a regulatory filing, or a board matter, it is not. And yet many legal teams are counting Copilot as their AI strategy.

The core problem: horizontal tools cannot solve vertical problems

Why generalist AI fails at legal work

What it is

Copilot is a horizontal tool. It is designed to function across the full surface area of Microsoft's product suite: Word, Outlook, Teams, Excel, SharePoint. Its architecture reflects a product decision to be broadly useful rather than deeply accurate in any particular domain. That breadth is its commercial selling point. It is also why it fails legal work.

Legal work is not horizontal. It is deeply vertical, with domain-specific requirements that a general-purpose assistant is structurally unsuited to meet. A contract review is not a summarisation task. It requires understanding of clause hierarchy, risk allocation, governing law, standard market positions, and the specific commercial context of the relationship. An advice memo requires knowledge of relevant legislation, regulatory precedent, and the organisation's risk tolerance. A promotional compliance check requires accurate interpretation of 180-plus laws across multiple jurisdictions.

Why it stalls teams

The failure mode for horizontal AI in legal is not dramatic. Lawyers use the tool, find it occasionally useful for low-stakes tasks, and quietly stop relying on it for anything that matters. The tool remains in the technology stack, is counted in the budget, and is pointed to as evidence of AI adoption. But it is not changing how legal work gets done.

The more dangerous failure mode is lawyers using horizontal AI for substantive legal work and not recognising the outputs as unreliable. A contract clause flagged as standard by Copilot may not be standard for your industry, your counterparty, or your risk posture. The AI does not know. It has no reference point for what acceptable looks like in your context. And the output looks authoritative even when it is not.

This creates a specific risk that is worse than not using AI at all: the illusion of having been checked.

What high performing teams do

The legal teams generating real, measurable value from AI have made a clear architectural decision: they use horizontal tools for horizontal tasks and purpose-built tools for legal work.

Separate the use cases explicitly. Copilot or similar tools are appropriate for drafting internal communications, summarising meeting notes, and managing personal productivity. They are not appropriate for contract review, legal advice, regulatory compliance assessment, or any output that will be acted on without expert review.
Set accuracy requirements before selecting tools. Define, for each use case, what accuracy rate is required for the output to be usable. Legal advice outputs typically require very high accuracy thresholds. Horizontal tools cannot reliably meet those thresholds against your organisation's specific legal context.
Evaluate AI against your actual work, not vendor benchmarks. The question is not how the tool performs on synthetic test cases. It is how it performs on the contracts, queries, and compliance questions your team handles every day.
Make the distinction between productivity AI and legal AI explicit and formal. Document it in the AI strategy. Copilot has a role. That role is not legal work.
Evaluate purpose-built platforms on context accumulation as a primary criterion. Ask vendors: after twelve months of use, what does the system know about our organisation that it did not know on day one? How is that knowledge applied to new matters?
Audit the accuracy of AI-assisted outputs over time. Teams that track accuracy rates build evidence-based confidence in purpose-built AI and evidence-based caution about horizontal tools.
Treat the deployment of governed legal AI as a risk management intervention, not just an efficiency play. The business case includes the risk reduction from replacing ungoverned AI use with governed AI use.
Deploy self-service legal AI for the most common business queries. When a marketing manager can get an accurate, governed answer to a standard promotional compliance question in two minutes, they stop finding their own answer in ChatGPT.
Communicate the why to the business. Teams that explain the distinction between governed and ungoverned AI, and provide the governed alternative, see significantly higher uptake than teams that simply announce a new tool.

The context gap: why your organisation's knowledge matters

What purpose-built legal AI does differently

What it is

The fundamental difference between horizontal AI and purpose-built legal AI is context. Copilot knows a great deal about language. It knows nothing specific about your organisation. It does not know your standard positions on limitation of liability. It does not know the risk appetite your board has established. It does not know that you always reject indemnification clauses structured in a particular way. It cannot apply your playbooks, because it has never seen them.

Purpose-built legal AI is designed around the premise that your organisation's knowledge is the most valuable input. It accumulates context across every matter, every clause position, every escalation decision, and applies that accumulated knowledge to the next piece of work. After twelve months of use, it knows your organisation's legal work in a way that no horizontal tool ever can.

Why it stalls teams

Teams that have invested in horizontal AI often resist the transition to purpose-built tools because the sunk cost of the Microsoft licence makes the investment feel covered. The legal function has been pointed to as part of the Copilot rollout. Admitting that it is not meeting legal's needs requires a difficult internal conversation about the distinction between productivity AI and legal AI.

What high performing teams do

Make the distinction between productivity AI and legal AI explicit and formal. Document it in the AI strategy. Copilot has a role. That role is not legal work.
Evaluate purpose-built platforms on context accumulation as a primary criterion. Ask vendors: after twelve months of use, what does the system know about our organisation that it did not know on day one? How is that knowledge applied to new matters?
Audit the accuracy of AI-assisted outputs over time. Teams that track accuracy rates build evidence-based confidence in purpose-built AI and evidence-based caution about horizontal tools.

The risk of doing nothing: AI bush lawyers

What happens when legal does not provide a governed alternative

What it is

The alternative to deploying a governed legal AI platform is not the absence of AI in legal work. Business teams are already using whatever AI is available to them to answer legal questions, review contract language, and generate compliance assessments. The question is not whether AI will be used for legal work in your organisation. It is whether that AI will be governed.

Consumer AI tools are being used right now, by people across your business, to get answers to questions that should be routed through legal. When legal does not provide a sanctioned, accurate, governed alternative, the vacuum is filled by ChatGPT, Copilot, or whatever the individual has access to. The advice generated may be wrong. It will be acted on anyway.

Why it stalls teams

The AI bush lawyer problem is difficult to address through prohibition. Telling the business not to use AI without providing an alternative creates resentment and drives the behaviour underground. The prohibition is unenforceable at scale. The only effective response is to provide a better alternative.

What high performing teams do

Treat the deployment of governed legal AI as a risk management intervention, not just an efficiency play. The business case includes the risk reduction from replacing ungoverned AI use with governed AI use.
Deploy self-service legal AI for the most common business queries. When a marketing manager can get an accurate, governed answer to a standard promotional compliance question in two minutes, they stop finding their own answer in ChatGPT.
Communicate the why to the business. Teams that explain the distinction between governed and ungoverned AI, and provide the governed alternative, see significantly higher uptake than teams that simply announce a new tool.

Source: Plexus Future-Ready General Counsel 2026 Survey, n=150 General Counsels, January 2026. External citations: Thomson Reuters Generative AI in Professional Services Report 2025; ACC/Everlaw GenAI Survey 2025, n=657; Gartner Legal and Compliance Leader research 2025.

Ready to find out where your team sits on the maturity spectrum? Take the AI maturity assessment or explore the Plexus platform.

Questions? We have answers.

Is Copilot ever appropriate for legal work?

Copilot is appropriate for administrative and productivity tasks that do not require domain accuracy: drafting internal emails, summarising meeting notes, formatting documents, and managing calendar and task workflows. It is not appropriate for contract review, legal advice, regulatory compliance assessment, or any output that will be relied upon without expert review. The distinction is not whether the task involves a legal document. It is whether the accuracy of the output matters.

What makes a legal AI tool "purpose-built"?

A purpose-built legal AI is one designed specifically for the accuracy requirements, workflow patterns, and context accumulation needs of in-house legal work. Key characteristics include: training on legal-specific data rather than general internet content; the ability to accumulate and apply your organisation's specific legal context (standard positions, risk tolerance, clause libraries); explainability of outputs with source attribution; and human-in-the-loop design for high-stakes outputs. Purpose-built tools are evaluated against legal accuracy requirements, not general language model benchmarks.

How do I make the case internally to move beyond Copilot for legal work?

The most effective internal argument is a direct comparison of output accuracy on real legal tasks. Run Copilot and a purpose-built legal AI against the same five contract review tasks that your team handles regularly. Document the accuracy of each output against your standard positions and risk criteria. The difference is typically clear and provides a defensible basis for the platform decision.

What is the risk of relying on horizontal AI for legal work?

The primary risk is output that looks authoritative but is inaccurate in ways a non-expert cannot detect. Horizontal AI does not know your organisation's legal context. It will apply generic market knowledge, which may not reflect your risk posture, your standard positions, or the specific regulatory environment in which you operate. Outputs that are acted on without expert review can create legal exposure without the organisation realising it until a dispute arises.

How do we handle the sunk cost of existing Microsoft licences?

The Microsoft licence cost is real and should be treated as covering legitimate productivity use cases. The question is not whether to abandon Copilot entirely but whether to supplement it with purpose-built legal AI for the use cases where accuracy is non-negotiable. Most organisations running purpose-built legal AI alongside Microsoft productivity tools find that the overlap is minimal and the use cases are clearly distinct.

Andrew Mellett

Andrew Mellett is the Founder and CEO of Plexus, a global leader in AI-powered legal technology. Recognised by the Financial Times and Harvard Business Review for his pioneering work in legal innovation, Andrew leads Plexus’s mission to train digital lawyers, helping the world’s top companies streamline legal operations and scale expertise with artificial intelligence.

All your legal work in one AI-powered platform

Faster reviews, self-service for business teams, and smarter compliance in every workflow.

Related resources

Legal Operations & Scale Legal AI

How to actually evaluate legal software (without wasting three months)

Cadell Falconer

As Head of Product at Plexus, Cadell Falconer brin...

Featured Legal AI

Why AI intake is more important than it sounds

Cadell Falconer

As Head of Product at Plexus, Cadell Falconer brin...

Featured Legal AI

AI knows your industry. It doesn't know your organisation. Playbooks change that.

Cadell Falconer

As Head of Product at Plexus, Cadell Falconer brin...

Legal Operations & Scale Legal Technology Legal AI

Why In-House Legal Teams Are Moving Beyond Single-Contract Review

Until recently, that kind of analysis meant one thing: open every contract, review them side by side, and rely...

Cadell Falconer

As Head of Product at Plexus, Cadell Falconer brin...

Unified platform

Get to Know Plexus AI

Experience Plexus Counsel

Find customers like you

Explore potential savings with Plexus

Life at Plexus

Legal AI

Marketing Compliance

Why Copilot isn't the answer for legal teams

Andrew Mellett

The core problem: horizontal tools cannot solve vertical problems

What it is

Why it stalls teams

What high performing teams do

The context gap: why your organisation's knowledge matters

What it is

Why it stalls teams

What high performing teams do

The risk of doing nothing: AI bush lawyers

What it is

Why it stalls teams

What high performing teams do

Questions? We have answers.

Andrew Mellett

All your legal work in one AI-powered platform

Related resources

How to actually evaluate legal software (without wasting three months)

Why AI intake is more important than it sounds

AI knows your industry. It doesn't know your organisation. Playbooks change that.

Why In-House Legal Teams Are Moving Beyond Single-Contract Review

Unified platform

Get to Know Plexus AI

Experience Plexus Counsel

Find customers like you

Explore potential savings with Plexus

Life at Plexus

Legal AI

Marketing Compliance

Why Copilot isn't the answer for legal teams

Andrew Mellett

The core problem: horizontal tools cannot solve vertical problems

What it is

Why it stalls teams

What high performing teams do

The context gap: why your organisation's knowledge matters

What it is

Why it stalls teams

What high performing teams do

The risk of doing nothing: AI bush lawyers

What it is

Why it stalls teams

What high performing teams do

Questions? We have answers.

Andrew Mellett

All your legal work in one AI-powered platform

Related resources

How to actually evaluate legal software (without wasting three months)

Why AI intake is more important than it sounds

AI knows your industry. It doesn't know your organisation. Playbooks change that.

Why In-House Legal Teams Are Moving Beyond Single-Contract Review

Don't miss out on Perspectives by Plexus each month