AI実行：ソフトウェアにおける「テキストとしてのAI」の終焉

ソフトウェア開発における人工知能の状況は、大きな変革期を迎えています。過去2年間、AIとの対話における主要なパラダイムは、単純な交換を伴うものでした。つまり、テキストを入力し、テキスト出力を受け取り、次に手動で次の行動を決定するというものです。この「テキストとしてのAI」の時代は、画期的なものではありましたが、今やより動的で統合されたアプローチに道を譲りつつあります。登場するのはGitHub Copilot SDKであり、AIを実行としてのインターフェースとする新時代を告げています。

本番環境のソフトウェアは本質的に実行に関わるものです。ステップの計画、ツールの呼び出し、ファイルの変更、エラーからの回復、制約への適応などです。これらは、単なるテキスト生成では完全に包含できない、複雑なマルチステップ操作です。GitHub Copilot SDKは、このギャップに直接対処し、GitHub Copilot CLIを支える強力な実行レイヤーを、あらゆるソフトウェアアプリケーション内でプログラム可能な機能として利用可能にします。これにより、チームは実稼働環境でテスト済みのプランニングおよび実行エンジンをシステムに直接組み込むことができ、AIを活用したアプリケーションのアーキテクチャと運用方法が根本的に変わります。

静的なスクリプトから適応型エージェントワークフローへ

従来のソフトウェア開発は、反復的なタスクを自動化するためにスクリプトやグルーコードに長らく依存してきました。固定されたシーケンスには効果的ですが、これらのソリューションは、コンテキストのニュアンス、実行中の変更、または堅牢なエラー回復の必要性に直面するとすぐに脆くなります。開発者は、エッジケースをハードコーディングしたり、特注のオーケストレーションレイヤーを構築したりすることが多く、これは時間と労力を要し、しばしば持続不可能な努力です。

GitHub Copilot SDKは、あらゆるステップを明示的にエンコードするのではなく、意図を委譲することを可能にすることで、これらの制約からアプリケーションを解放します。「このリポジトリをリリース用に準備する」必要があるアプリケーションを想像してみてください。厳格なスクリプトの代わりに、Copilot SDKはAIエージェントに次のことを可能にします。

リポジトリの構造とコンテンツを探索する。
ドキュメントの更新、テストの実行、バージョン番号の更新など、必要なステップを計画する。
必要に応じてファイルを変更する。
システム環境内でコマンドを実行する。
いずれかのステップが失敗した場合や、新しい情報が出現した場合に動的に適応する。これらすべては、事前定義された境界と権限内で動作します。

この変化は、現代のソフトウェアシステムにとって極めて重要です。アプリケーションがスケールし、環境が進化するにつれて、固定されたワークフローは失敗しやすくなります。Copilot SDKによって強化されたエージェント実行は、ソフトウェアが適応し、自己修正することを可能にし、複雑なオーケストレーションをゼロから再構築する絶え間ない負担なしに、可視性と制約を維持します。これにより、AIは開発ライフサイクルにおける能動的でインテリジェントな参加者となり、基本的なコード補完を超えてインテリジェントなタスク自動化へと移行します。これらの複雑なワークフローがどのように保護されているかについての詳細は、GitHub Agentic Workflowsのセキュリティアーキテクチャをご覧ください。

信頼性の高いAIのための構造化されたコンテキスト：Model Context Protocol (MCP)

「テキストとしてのAI」の時代におけるよくある落とし穴は、あまりにも多くのシステム動作とデータをAIプロンプトに押し込もうとすることでした。一見すると便利そうですが、ロジックをテキストでエンコードすると、ワークフローのテスト、推論、進化が困難になります。時間が経つにつれて、これらの凝ったプロンプトは、適切な構造化されたシステム統合の脆い代替品となります。

GitHub Copilot SDKは、**Model Context Protocol (MCP)**を活用し、コンテキストに対する構造化された構成可能なアプローチでこれに対処します。MCPを使用すると、開発者は次のことができます。

AIが呼び出すことができるドメイン固有のツールまたはエージェントスキルを定義する。
これらのツールとスキルをMCPを介して公開する。
実行エンジンがランタイムにコンテキストを動的に取得することを可能にする。

これは、サービス所有データ、APIスキーマ、過去の意思決定記録、依存関係グラフ、または内部APIなどの重要な情報が、もはやプロンプトに無理やり詰め込む必要がないことを意味します。代わりに、エージェントはプランニングおよび実行フェーズ中にこれらのシステムに直接アクセスします。たとえば、問題解決を担当する内部エージェントは、サービス所有権を自動的にクエリし、関連する履歴データを取得し、影響評価のために依存関係グラフを確認し、内部APIを参照して解決策を提案することができます。これらすべては、定義された安全制約に従いながら行われます。このアプローチは、コンテキストの注入が複雑になりがちなOpenAI APIでのプロンプトエンジニアリングのベストプラクティスの課題とは対照的です。

なぜこれが重要なのか：信頼性の高いAIワークフローは、根拠に基づいた、権限付与された、構造化されたコンテキストの上に構築されます。MCPは、エージェント実行が実際のツールと実際のデータ上で動作することを保証し、テキストベースのプロンプトエンジニアリングに伴う推測と脆さを排除する重要な基盤を提供します。

インフラとしてのAI：IDEを超えた実行の組み込み

歴史的に、開発者向けのAIツールの大半は統合開発環境（IDE）に限定されていました。コーディングには非常に価値がありますが、現代のソフトウェアエコシステムは単一のエディタをはるかに超えて広がっています。チームは、デスクトップアプリケーション、社内運用ツール、バックグラウンドサービス、SaaSプラットフォーム、イベント駆動型システムなど、様々な環境でエージェント機能を必要としています。

Copilot SDKはこれらの境界を打ち破り、実行をアプリケーション層の機能にします。これは、システムがイベント（ファイル変更、デプロイメントトリガー、ユーザーアクションなど）をリッスンし、Copilotをプログラム的に呼び出してエージェントワークフローを開始できることを意味します。プランニングと実行のループは、独立したインターフェースや開発者ツールとしてではなく、製品内で実行されます。

機能	「テキストとしてのAI」の時代	「実行としてのAI」の時代 (Copilot SDK)
インタラクション	テキスト入力、テキスト出力	プログラム可能な実行ループ
ワークフロー	手動での決定、脆いスクリプト	適応型、自己修正エージェント
コンテキスト	プロンプトに埋め込まれることが多い（脆い）	MCPを介して構造化、リアルタイム取得
統合	孤立した交換、IDE中心	どこにでも組み込み可能（アプリ、サービス、SaaS）
開発者の役割	プロンプトエンジニアリング、手動オーケストレーション	意図、制約、ツールの定義
コア原則	AIが助言し、人間が実行する	AIが計画・実行し、人間が監督する

なぜこれが重要なのか：AI実行がアプリケーションに直接組み込まれると、それは単なる便利な相棒ではなく、基本的なインフラストラクチャになります。ソフトウェアが実行される場所であればどこでも利用可能になり、AIの力をデジタル操作のあらゆる隅々にまで広げ、真にインテリジェントで適応性の高いソフトウェア環境を育みます。

アーキテクチャの転換：プログラム可能なAIとその未来

「テキストとしてのAI」から「実行としてのAI」への移行は、重要なアーキテクチャの進化を表しています。これは、AIエージェントが単にスニペットを生成するだけでなく、定義された制約の下で動作し、実際のシステムとシームレスに統合し、ランタイムにインテリジェントに適応できる、プログラム可能なプランニングおよび実行ループであるパラダイムを意味します。

GitHub Copilot SDKは、この未来を実現するための主要なイネーブラーです。これらの洗練された実行機能をプログラム可能なレイヤーとしてアクセス可能にすることで、開発チームは、AIオーケストレーションの基本的な「方法」を絶えず再構築するのではなく、ソフトウェアが達成すべきより高レベルの「内容」に集中できるようになります。この変化は、AIを単なる新しいユーティリティから、現代のソフトウェアアーキテクチャの中心的で不可欠なコンポーネントへと変革し、全体としてより回復力があり、自律的で、インテリジェントなアプリケーションを約束します。アプリケーションがロジックをトリガーできるなら、今やエージェント実行をトリガーでき、真にスマートなソフトウェアの新時代が幕を開けます。

元の情報源

https://github.blog/ai-and-ml/github-copilot/the-era-of-ai-as-text-is-over-execution-is-the-new-interface/

よくある質問

What is the core shift from 'AI as text' to 'AI as execution' introduced by the GitHub Copilot SDK?

The fundamental shift signifies a move from AI systems that merely generate text output from text input, requiring manual human intervention for the next steps, to systems where AI can actively plan, execute, adapt, and recover from errors within a predefined set of constraints. This means AI transitions from a passive assistant to an active participant, capable of orchestrating complex, multi-step operations directly within software applications, making it a functional component rather than just a conversational interface. The Copilot SDK provides the tools to embed this execution layer into any application.

How does the GitHub Copilot SDK enable sophisticated agentic workflows within applications?

The GitHub Copilot SDK empowers applications by providing access to the same production-tested planning and execution engine that drives GitHub Copilot CLI. Instead of building complex orchestration stacks from scratch, developers can embed this SDK to delegate intent to AI agents. These agents can explore repositories, plan necessary steps, modify files, run commands, and adapt to unforeseen issues—all while respecting defined boundaries. This allows software to become more adaptive and resilient, moving beyond rigid, scripted workflows to dynamic, context-aware operations.

What is the Model Context Protocol (MCP) and why is it crucial for grounded AI execution?

The Model Context Protocol (MCP) is a vital component that enables structured and composable context for AI agents. Rather than embedding critical system logic and data within prompts—a practice that leads to brittle, hard-to-test workflows—MCP allows applications to define domain-specific tools and agent skills. The execution engine then uses MCP to retrieve relevant context directly at runtime, such as service ownership data, API schemas, or dependency rules. This ensures that AI agents operate on real, permissioned data and systems, preventing guesswork and making AI workflows more reliable and maintainable.

Beyond the Integrated Development Environment (IDE), where can the GitHub Copilot SDK embed AI execution?

The GitHub Copilot SDK liberates AI execution from being confined primarily to the IDE, allowing it to function as a pervasive application-layer capability. This means agentic capabilities can be seamlessly integrated into a wide array of environments, including desktop applications, internal operational tools, background services, SaaS platforms, and event-driven systems. By enabling applications to programmatically invoke Copilot upon specific events—like a file change, deployment trigger, or user action—the SDK transforms AI from a mere helper in a side window into core infrastructure that operates wherever the software runs.

What are the primary benefits of delegating multi-step tasks to AI agents using the Copilot SDK?

Delegating multi-step tasks to AI agents via the Copilot SDK offers significant advantages over traditional scripting. It allows software to handle workflows that are context-dependent, change dynamically mid-run, or require robust error recovery, which typically break down fixed scripts. By delegating 'intent' rather than explicit steps, agents can autonomously explore, plan, execute, and adapt within defined constraints. This leads to more scalable, adaptable, and observable systems, freeing developers from continually rebuilding bespoke orchestration layers for complex, evolving processes.

How does the Copilot SDK improve the reliability and adaptability of AI-powered systems?

The Copilot SDK enhances reliability and adaptability by providing a robust execution layer and integrating structured context. Its production-tested planning and execution engine ensures agents can plan complex operations, execute commands, modify files, and recover from errors, making systems more resilient. Furthermore, by utilizing the Model Context Protocol (MCP), agents access real-time, structured, and permissioned context—like API schemas or dependency graphs—rather than relying on potentially outdated or generalized prompt information. This grounding in real data ensures agents make informed decisions, reducing errors and increasing the system's ability to adapt to changing conditions and constraints.

Is the GitHub Copilot SDK primarily for professional developers, or can others benefit from its capabilities?

While the GitHub Copilot SDK is designed to empower professional developers by extending agentic AI capabilities into their applications and infrastructure, its benefits ripple outwards. By enabling AI to handle complex, multi-step tasks and integrate directly into various software systems, it streamlines workflows, reduces manual effort, and enhances the adaptability of applications. This ultimately benefits end-users and organizations by leading to more efficient, intelligent, and robust software, even if the direct interaction with the SDK is primarily on the developer's side. The SDK makes AI a fundamental infrastructure component across the software ecosystem.