기초 모델 정보 얻기

마지막 업데이트 날짜: 2025년 4월 23일

기초 모델 정보 얻기

watsonx.ai 에 배포된 기초 모델 목록을 가져와서 유용한 방법으로 목록을 필터링하십시오.

발전하는 방법

다음과 같은 프로그래밍 방법을 사용하면 사용 가능한 기초 모델에 대한 정보를 얻을 수 있습니다

또는, 기초 모델 목록을 보고, watsonx.ai 의 리소스 허브에서 필터링할 수 있습니다. 자세한 내용은 다음 자료를 참고하세요

휴식 (REST API)

watsonx.ai 의 List the available foundation models 메소드를 사용하면 사용 가능한 기초 모델에 대한 정보를 얻을 수 있습니다.

반환되는 모델 정보에는 코드에서 모델을 참조하는 데 필요한 모델 ID가 포함됩니다.

사용 가능한 기초 모델 나열

watsonx.ai API의 사용 가능한 기초 모델 나열 메서드는 watsonx.ai 에서 IBM 에서 제공하는 기초 모델에 대한 정보를 가져와서 즉시 추론할 수 있습니다.

curl -X GET \
  'https://{region}.ml.cloud.ibm.com/ml/v1/foundation_model_specs?version=2024-05-01'

모델 ID를 얻으면 다음과 같이 코드에서 모델 ID를 참조할 수 있습니다

curl --request POST 'https://{cluster_url}/ml/v1/text/generation?version=2023-05-02'
-H 'Authorization: Bearer eyJhbGciOiJSUzUxM...'
-H 'Content-Type: application/json'
-H 'Accept: application/json'
--data-raw '{
  "model_id": "google/flan-t5-xxl",
  "input": "Tell me a story",
  "project_id": "63dc4cf1-252f-424b-b52d-5cdd9814987f"
}'

맞춤형 기초 모델 나열

접근할 수 있는 배포된 커스텀 파운데이션 모델 목록을 얻으려면 다음 방법을 사용하십시오. 이 방법은 무기명 토큰이 필요합니다.

curl -X GET \
  'https://{region}.ml.cloud.ibm.com/ml/v4/deployments?version=2024-12-12&type=custom_foundation_model'

디플로이 온 디맨드 모델 목록

IBM 에서 제공하는 기초 모델 목록을 직접 배포하려면 다음 방법을 사용하십시오

curl -X GET \
  'https://{region}.ml.cloud.ibm.com/ml/v1/foundation_model_specs?version=2024-12-10&filters=curated'

반환된 모델 ID를 사용하여 배포 온 디맨드 기반 모델을 배포 공간에 배포합니다. 모델을 배포한 후에는 배포에 사용된 API 엔드포인트를 사용하여 모델을 추론할 수 있습니다.

제공된 기초 모델의 추론에 필요한 모델 ID 찾기

제공된 기초 모델의 모델 ID를 찾으려면 다음 링크를 클릭하십시오

온디맨드 파운데이션 모델 배포를 위한 모델 ID

주의: 모델 ID로 주문형 기반 모델을 추론하지 마세요. 모델 ID는 모델 배포에만 사용합니다. 모델을 배포한 후에는 배포에 사용된 API 엔드포인트를 사용하여 모델을 추론할 수 있습니다.

watsonx.ai 의 REST API를 사용하면 배포 방법 생성을 통해 기초 모델을 배포한 다음, 배포 > 텍스트 추론 방법을 사용하여 배포된 기초 모델을 추론할 수 있습니다.

자세한 내용은 REST API를 사용하여 온디맨드로 배포된 기초 모델 배포 및 관리하기를 참조하세요.

다음 목록은 API에서 온디맨드 기반 모델을 배포할 때 ' {model_id} ' 매개변수에 사용할 수 있는 값을 보여줍니다.

granite-3-1-8b-base

ibm/granite-3-1-8b-base-curated

granite-3-3-8b-instruct-curated

ibm/granite-3-3-2b-instruct-curated

granite-3-3-2b-instruct-curated

ibm/granite-3-3-2b-instruct-curated

granite-3-2-8b-instruct

ibm/granite-3-2-8b-instruct-curated

granite-3-8b-base

ibm/granite-3-8b-base-curated

granite-7b-lab

ibm/granite-7b-lab-curated

granite-8b-japanese

ibm/granite-8b-japanese-curated

granite-20b-multilingual

ibm/granite-20b-multilingual-curated

granite-13b-chat-v2

ibm/granite-13b-chat-v2-curated

granite-13b-instruct-v2

ibm/granite-13b-instruct-v2-curated

granite-20b-code-base-schema-linking

ibm/granite-20b-code-base-schema-linking-curated

granite-20b-code-base-sql-gen

ibm/granite-20b-code-base-sql-gen-curated

allam-1-13b-instruct

ibm/allam-1-13b-instruct-curated

codellama-34b-instruct-hf

meta-llama/codellama-34b-instruct-hf-curated

deepseek-r1-distill-llama-8b

deepseek-ai/deepseek-r1-distill-llama-8b-curated

deepseek-r1-distill-llama-70b

deepseek-ai/deepseek-r1-distill-llama-70b-curated

eurollm-1-7b-instruct

utter-project/eurollm-1-7b-instruct-curated

eurollm-9b-instruct

utter-project/eurollm-9b-instruct-curated

flan-t5-xl-3b

google/flan-t5-xl-curated

flan-t5-xxl-11b

google/flan-t5-xxl-curated

flan-ul2-20b

google/flan-ul2-curated

llama-2-13b-chat

meta-llama/llama-2-13b-chat-curated

llama-2-70b-chat

meta-llama/llama-2-70b-chat-curated

llama-3-8b-instruct

meta-llama/llama-3-8b-instruct-curated

llama-3-70b-instruct

meta-llama/llama-3-70b-instruct-curated

llama-3-1-8b

meta-llama/llama-3-1-8b-curated

llama-3-1-8b-instruct

meta-llama/llama-3-1-8b-instruct-curated

llama-3-2-11b-vision-instruct

meta-llama/llama-3-2-11b-vision-instruct-curated

llama-3-3-70b-instruct

meta-llama/llama-3-3-70b-instruct-curated

llama-3-3-70b-instruct-hf

meta-llama/llama-3-3-70b-instruct-hf-curated

mixtral-8x7b-base

mistralai/mixtral-8x7b-base-curated

mixtral-8x7b-instruct-v01

mistralai/mixtral-8x7b-instruct-v01-curated

mistral-nemo-instruct-2407

mistralai/mistral-nemo-instruct-2407-curated

mt0-xxl-13b

bigscience/mt0-xxl-curated

poro-34b-chat

lumiopen/poro-34b-chat-curated

상위 주제: 생성 AI 솔루션 코딩

주제가 도움이 되었습니까?

0/1000

발전하는 방법Copy link to section

휴식 (REST API)Copy link to section

사용 가능한 기초 모델 나열Copy link to section

맞춤형 기초 모델 나열Copy link to section

디플로이 온 디맨드 모델 목록Copy link to section

제공된 기초 모델의 추론에 필요한 모델 ID 찾기Copy link to section

온디맨드 파운데이션 모델 배포를 위한 모델 IDCopy link to section