Add cross-region inference profiles for Llama 3.1 models. (#72)

This commit is contained in:
bkocik
2024-11-19 20:57:35 -05:00
committed by GitHub
parent 11a31b5584
commit 6849ca828a
3 changed files with 32 additions and 14 deletions

View File

@@ -173,6 +173,8 @@ Currently, Bedrock Access Gateway only supports cross-region Inference for the f
- Claude 3 Opus - Claude 3 Opus
- Claude 3 Sonnet - Claude 3 Sonnet
- Claude 3.5 Sonnet - Claude 3.5 Sonnet
- Meta Llama 3.1 8b Instruct
- Meta Llama 3.1 70b Instruct
**Prerequisites:** **Prerequisites:**
- IAM policies must allow cross-region access,Callers need permissions to access models and inference profiles in both regions (added in cloudformation template) - IAM policies must allow cross-region access,Callers need permissions to access models and inference profiles in both regions (added in cloudformation template)

View File

@@ -175,6 +175,8 @@ Cross-Region Inference 支持跨区域访问的基础模型,即允许用户在
- Claude 3 Opus - Claude 3 Opus
- Claude 3 Sonnet - Claude 3 Sonnet
- Claude 3.5 Sonnet - Claude 3.5 Sonnet
- Meta Llama 3.1 8b Instruct
- Meta Llama 3.1 70b Instruct
**使用前提:** **使用前提:**
- IAM Policy 有 inference profiles 相关的权限和调用模型的权限 (cloudformation template 中已添加) - IAM Policy 有 inference profiles 相关的权限和调用模型的权限 (cloudformation template 中已添加)

View File

@@ -139,12 +139,26 @@ class BedrockModel(BaseChatModel):
"tool_call": False, "tool_call": False,
"stream_tool_call": False, "stream_tool_call": False,
}, },
# Llama 3.1 8b cross-region inference profile
"us.meta.llama3-1-8b-instruct-v1:0": {
"system": True,
"multimodal": False,
"tool_call": False,
"stream_tool_call": False,
},
"meta.llama3-1-8b-instruct-v1:0": { "meta.llama3-1-8b-instruct-v1:0": {
"system": True, "system": True,
"multimodal": False, "multimodal": False,
"tool_call": False, "tool_call": False,
"stream_tool_call": False, "stream_tool_call": False,
}, },
# Llama 3.1 70b cross-region inference profile
"us.meta.llama3-1-70b-instruct-v1:0": {
"system": True,
"multimodal": False,
"tool_call": False,
"stream_tool_call": False,
},
"meta.llama3-1-70b-instruct-v1:0": { "meta.llama3-1-70b-instruct-v1:0": {
"system": True, "system": True,
"multimodal": False, "multimodal": False,