AWS 最新情報（自動翻訳版）

Introducing the Instance Topology API for ML and HPC workloads

Posted On: Nov 14, 2023

AWS announces the general availability of the Amazon Elastic Cloud Compute (EC2) Instance Topology API for Machine Learning and High Performance Computing workloads. The Instance Topology API provides customers a unique per account hierarchical view of the relative proximity between instances. Customers can describe their instance topology to identify instances that are in a tightly coupled group, and can use it to further improve communication time, reducing job completion time.

Customers running distributed parallel workloads like the training of large language models and computational fluid dynamics are scaling their workloads to thousands of EC2 instances. With the EC2 Instance Topology API, customers can describe topology as a network node set and filter by availability zone, group name, instance type, and instance ID. The network node set represents the top down relation of instances to one another within a region. Customers can ingest this topology into their scheduler of choice and use it to allocate instances to jobs on a best fit basis.

The EC2 Instance Topology API is now available in the following AWS Regions: US East (N. Virginia), US East (Ohio), US West (Oregon), Asia Pacific (Seoul), Asia Pacific (Tokyo), Canada (Central), Europe (Frankfurt), Europe (Ireland), and Europe (Stockholm). It is available on the following instance platforms: HPC6id, HPC6a, HPC7a, HPC7g, P3dn, P4d, P4de, P5, TRN1, TRN1n.

To learn more, please visit the latest EC2 User Guide here.

ML および HPC ワークロード用のインスタンストポロジー API の紹介

AWS は、機械学習とハイパフォーマンスコンピューティングのワークロード向けの Amazon Elastic Cloud Compute (EC2) インスタンストポロジー API の一般提供を発表しました。インスタンストポロジー API は、インスタンス間の相対的な近接度を、アカウントごとに異なる階層構造でお客様に提供します。お客様はインスタンストポロジを記述して緊密に結合されたグループに属するインスタンスを識別でき、これを使用して通信時間をさらに短縮し、ジョブの完了時間を短縮できます。大規模な言語モデルや計算流体力学のトレーニングなど、分散型の並列ワークロードを実行しているお客様は、ワークロードを数千の EC2 インスタンスにスケーリングしています。EC2 Instance Topology API を使用すると、お客様はトポロジをネットワークノードセットとして記述し、アベイラビリティーゾーン、グループ名、インスタンスタイプ、インスタンスタイプ、インスタンス ID でフィルタリングできます。ネットワークノードセットは、リージョン内のインスタンス同士のトップダウン関係を表します。顧客はこのトポロジーを任意のスケジューラーに取り込み、それを使用して最適な条件でジョブにインスタンスを割り当てることができます。 EC2 インスタンストポロジー API は、米国東部 (バージニア北部)、米国東部 (オハイオ)、米国西部 (オレゴン)、アジアパシフィック (ソウル)、アジアパシフィック (東京)、カナダ (中部)、ヨーロッパ (フランクフルト)、ヨーロッパ (アイルランド)、ヨーロッパ (ストックホルム) の AWS リージョンで利用できるようになりました。次のインスタンスプラットフォームで使用できます:hpc6id、hpc6a、hpc7g、P3dn、P4d、P4de、P5、TRN1、TRN1n。詳細については、こちらから最新の EC2 ユーザーガイドをご覧ください。