文書のパッセージの再ランク付け

最終更新: 2025年2月07日

文書のパッセージの再ランク付け

watsonx.aiから利用可能なリランカーモデルとテキストリランカーAPIを使用して、指定されたクエリに答える可能性が高いものから低いものへと、一連の文章をランク付けする。

開発方法

これらのプログラミング方法を使用して、文書からテキストを抽出することができます

REST API
Python
Node.js

概要

クエリに関連性の高い順に一連の文章をランク付けすることは、検索や検索結果の取得、チャットワークフローに役立ちます。特に、事実に基づく回答が重要な場面で有効です。

サポートされている基盤モデル

watsonx.ai で利用可能なリランカーモデルの詳細については、「サポートされているエンコーダモデル」を参照してください。

使用可能なリランカーモデルのリストを取得するには、watsonx.aias a service API のList the available foundation modelsメソッドを使用します。利用可能なリランカー・モデルのみを返すには、'filters=function_rerankパラメータを指定する。

例：

curl -X GET \
  'https://{region}.ml.cloud.ibm.com/ml/v1/foundation_model_specs?version=2024-10-18&filters=function_rerank'

注：埋め込みモデルの多くは、リランク機能をサポートしていることも示している。しかし、埋め込みモデルは、精度の低いランキング方法であるセマンティック・リランキングをサポートしている。エンベッディングモデルがパッセージを再ランク付けする際には、エンベッディングベクター値から得られる類似度スコアに基づいてランク付けを行う。クロスエンコーダーモデルは、各パッセージをクエリと明示的に比較し、ペアリングごとにランキングスコアを生成するため、リランキングにより効果的である。

Python

watsonx.ai Python ライブラリのリランカークラスを参照してください。

REST API

次の図に示されているように、 watsonx.ai APIのリランカーメソッドは、クエリと文書の抜粋のリストを含むREST APIリクエストを受け入れます。 APIはこれらのテキスト文字列を再ランク付けモデルに送信します。再ランク付けモデルでは、クエリと各パッセージを順番に組み合わせて、テキストを埋め込みベクトルに変換し、各ペアのベクトルを比較して、それらの類似度をスコア化します。次に、モデルは生成された類似スコアに基づいてパッセージを再ランク付けします。このモデルはテキストをテキスト埋め込みに変換しますが、埋め込みは返されません。テキスト埋め込みは類似スコアを生成するために使用され、スコアのみが返されます。

リランカーAPIリクエストの流れを示す図

API リファレンス

詳細はAPIリファレンス・ドキュメントを参照。

REST API の例

この例のコードでは、「ms-marco-minilm-l-12-v2モデルを使用して、クエリに答える可能性が最も高い情報を持つ文章が最初にリストされるように、次の文章を再ランク付けしている。

照会

エージェント主導のAIワークフローとは？

パッセージ

入力パッセージはwatsonx.aiの用語集にある定義である：

項目 0. foundation modelとは、幅広い下流タスクに適応できる大規模な生成AIモデルである。
項目1. 生成AIは、テキスト、ソースコード、画像、音声、合成データなど、さまざまな種類のコンテンツを生成できるAIアルゴリズムの一種である。
項目2. エージェントAIは、プロンプトを複数のタスクに分解し、適切なAIエージェントにタスクを割り当て、人間の介入なしに答えを合成することができる生成的AIフローである。
項目3. AI倫理は、リスクや有害な結果を減らしつつ、AIの有益な影響を最適化する方法を研究する学際的な分野である。 AI倫理の問題の例としては、データの責任とプライバシー、公平性、説明可能性、堅牢性、透明性、環境の持続可能性、包括性、道徳的主体性、価値観の調整、説明責任、信頼、技術の誤用などがある。
項目4. AIガバナンスとは、組織の指示、スタッフ、プロセス、システムを通じて、AIのライフサイクルを通じて指示、評価、監視、是正措置を講じ、AIシステムが組織の意図通り、利害関係者の期待通り、関連する規制の要求通りに運用されていることを保証するための、組織の統治行為である。

REST API リクエストの例

この例では、5つのパッセージのみが再ランク付けのために提出されている。最大1,000入力まで指定可能。しかし、クロスエンコーダー・モデルは、各パッセージをクエリーと一緒に次々に処理するため、指定するパッセージが多ければ多いほど、リランキング処理に時間がかかる。

あなたが提出する各入力は、リランカー・モデルによって定義された最大入力トークン制限に適合していなければなりません。

この例では、モデルが許容する以上のトークンがある場合に対応するため、行を強制的に切り捨てる「truncate_input_tokensパラメータを指定しています。さもなければ、リクエストは失敗するかもしれない。

この例では、元のパッセージのテキストがレスポンスに含まれるように、'inputsリターン・オプション・パラメータが含まれています。実際のワークフローでは、「inputsパラメータを含める必要はないかもしれない。

テキスト・リランカー・メソッドのペイロードの「model_id」に、使用したいリランカー・モデルを指定する。

以下の例では、{url}variable with the right value for your instance, such asus-south.ml.cloud.ibm.comを置き換えます。独自のベアラートークンとプロジェクトIDを追加します。

curl -X POST \
  'https://{url}/ml/v1/text/rerank?version=2024-10-17' \
  --header 'Accept: application/json' \
  --header 'Content-Type: application/json' \
  --header 'Authorization: Bearer eyJraWQiOi...' \
  --data '{
      "inputs": [
        {
          "text": "A foundation model is a large-scale generative AI model that can be adapted to a wide range of downstream tasks."
        },
        {
          "text": "Generative AI is a class of AI algorithms that can produce various types of content including text, source code, imagery, audio, and synthetic data."
        },
        {
          "text": "Agentic AI is a generative AI flow that can decompose a prompt into multiple tasks, assign tasks to appropriate gen AI agents, and synthesize an answer without human intervention."
        },
        {
          "text": "AI ethics is a multidisciplinary field that studies how to optimize AI'\''s beneficial impact while reducing risks and adverse outcomes. Examples of AI ethics issues are data responsibility and privacy, fairness, explainability, robustness, transparency, environmental sustainability, inclusion, moral agency, value alignment, accountability, trust, and technology misuse."
        },
        {
          "text": "AI governance is an organization'\''s act of governing, through its corporate instructions, staff, processes and systems to direct, evaluate, monitor, and take corrective action throughout the AI lifecycle, to provide assurance that the AI system is operating as the organization intends, as its stakeholders expect, and as required by relevant regulation."
        }
      ],
      "query": "What is an agent-driven AI workflow?",
      "parameters":{
        "truncate_input_tokens": 512,
        "return_options":{
          "inputs":true
        }
      },
      "model_id": "cross-encoder/ms-marco-minilm-l-12-v2",
      "project_id": "51f3a990-4372-4ac3-9ddb-ed99d9b50840"
    }'

最高順位のパッセージだけを返したい場合は、「top_nパラメータを指定します。次のサンプルは、クエリに最も関連する1つのパッセージのみを返すようにモデルを指定する方法を示しています：

"parameters":{
    "truncate_input_tokens": 512,
    "return_options":{
      "inputs":true,
      "top_n": 1
    }
}

REST API レスポンスの例

リクエスト例に対して、以下の応答が返される。いくつか気づいたことがある：

パッセージは、高得点のパッセージを最初に表示するように並び替えられる。

このモデルは、'エージェントAI定義をリストの一番上に正確に返す。それは、それがエージェント駆動型ワークフローに関する質問に最も密接に関連しているからである。
0から4までのインデックス値は、リクエストで送信された'inputs配列の中のパッセージの元の位置を表す。

例えば、Agentic AIに関する文章はリストの3番目の文章なので、インデックス値は「2である。
input_token_countフィールドは、リクエストで処理されたトークンの総数を示す。

トークンのカウント方法についての詳細は、課金情報をご覧ください。

{
  "model_id": "cross-encoder/ms-marco-minilm-l-12-v2",
  "created_at": "2024-10-17T14:01:47.322Z",
  "results": [
    {
      "index": 2,
      "score": 5.063366413116455,
      "input": {
        "text": "Agentic AI is a generative AI flow that can decompose a prompt into multiple tasks, assign tasks to appropriate gen AI agents, and synthesize an answer without human intervention."
      }
    },
    {
      "index": 4,
      "score": -0.992393970489502,
      "input": {
        "text": "AI governance is an organization's act of governing, through its corporate instructions, staff, processes and systems to direct, evaluate, monitor, and take corrective action throughout the AI lifecycle, to provide assurance that the AI system is operating as the organization intends, as its stakeholders expect, and as required by relevant regulation."
      }
    },
    {
      "index": 1,
      "score": -3.5372314453125,
      "input": {
        "text": "Generative AI is a class of AI algorithms that can produce various types of content including text, source code, imagery, audio, and synthetic data."
      }
    },
    {
      "index": 0,
      "score": -4.646212100982666,
      "input": {
        "text": "A foundation model is a large-scale generative AI model that can be adapted to a wide range of downstream tasks."
      }
    },
    {
      "index": 3,
      "score": -4.926990032196045,
      "input": {
        "text": "AI ethics is a multidisciplinary field that studies how to optimize AI's beneficial impact while reducing risks and adverse outcomes. Examples of AI ethics issues are data responsibility and privacy, fairness, explainability, robustness, transparency, environmental sustainability, inclusion, moral agency, value alignment, accountability, trust, and technology misuse."
      }
    }
  ],
  "input_token_count": 292,
  "system": {
    "warnings": [
      {
        "message": "This model is a Non-IBM Product governed by a third-party license that may impose use restrictions and other obligations. By using this model you agree to its terms as identified in the following URL.",
        "id": "disclaimer_warning",
        "more_info": "https://dataplatform.cloud.ibm.com/docs/content/wsj/analyze-data/fm-models.html?context=wx"
      }
    ]
  }
}

先ほどのサンプルのように、パラメーター・オブジェクトに「"top_n": 1含めると、レスポンスは次のようになる：

{
  "model_id": "cross-encoder/ms-marco-minilm-l-12-v2",
  "created_at": "2024-10-17T20:18:50.867Z",
  "results": [
    {
      "index": 2,
      "score": 5.063366413116455,
      "input": {
        "text": "Agentic AI is a generative AI flow that can decompose a prompt into multiple tasks, assign tasks to appropriate gen AI agents, and synthesize an answer without human intervention."
      }
    }
  ],
  "input_token_count": 292,
  "system": {
    "warnings": [
      {
        "message": "This model is a Non-IBM Product governed by a third-party license that may impose use restrictions and other obligations. By using this model you agree to its terms as identified in the following URL.",
        "id": "disclaimer_warning",
        "more_info": "https://dataplatform.cloud.ibm.com/docs/content/wsj/analyze-data/fm-models.html?context=wx"
      }
    ]
  }
}

請求先情報

課金は、使用しているリランカー・モデルによって処理されたトークン数に基づいて行われます。単語ごとに使用されるトークンの数はモデルによって異なる。

トークンの詳細については、トークンとトークン化を参照してください。
モデルごとに許可されるトークンの最大数の詳細については、サポートされるエンコーダモデルを参照してください。
価格についての詳細は、ジェネレーティブAIアセットの課金詳細をご覧ください。

クロスエンコーダー・モデルは、クエリと各入力パッセージを一緒に処理して、各入力に対するスコアを生成することを覚えておいてください。したがって、トークンを合計するには、次のような数式を使わなければならない：

(クエリ内のトークン x 入力数) + (すべての入力内のトークンの合計 x クエリ数)

注:

APIは1つのクエリーしか受け付けない。したがって、「Number of queries常に1に等しい。

提出されたテキストに加えて、以下のマーカーがモデルによって使用される。各マーカーはトークンとしてカウントされる：

CLS:分類を意味し、モデルに提出されるシーケンスの先頭に追加され、シーケンスの開始を識別する。
SEP:セパレーターを意味し、あるシーケンスの終わりと次のシーケンスの始まりを識別するために使われる。

各入力には、クエリー＋入力シーケンスの開始を示す1つの「CLSマーカーと、2つの「SEPセパレーターマーカーが含まれる。したがって、（3×入力数）を先ほど導き出した合計に加え、合計を求める。

詳細情報

親トピック 生成的AIソリューションのコーディング

トピックは役に立ちましたか?

0/1000

開発方法Copy link to section

概要Copy link to section

サポートされている基盤モデルCopy link to section

PythonCopy link to section

REST APICopy link to section

API リファレンスCopy link to section

REST API の例Copy link to section

REST API リクエストの例Copy link to section

REST API レスポンスの例Copy link to section

請求先情報Copy link to section

詳細情報Copy link to section