Commit Graph

83 Commits

Author SHA1 Message Date
Aiden Dai
dc067affc0 Use yaml template 2024-12-16 16:33:37 +08:00
Aiden Dai
29621ae59c Automatically detect model list 2024-12-16 16:15:09 +08:00
Aiden Dai
d4938a0af2 Automatically detect model list 2024-12-16 16:01:59 +08:00
Attila Szucs
cb38d328aa Add environment variable for PORT (#47)
* Customizable port

* Fix CMD
2024-12-16 10:00:17 +08:00
Fabio Nonato
4fc0d3bc94 Image error fix (#80)
---------

Co-authored-by: Fabio Nonato <fnp@amazon.com>
2024-12-11 11:26:51 +08:00
Hans Knecht
241d5c0f3e feat: allow the use of an ENV variable to set the API key if the ParameterStore isn't used. (#40) 2024-12-06 14:32:06 +08:00
Fabian Fischer
25b3cfb146 feat: add amazon nova inference profiles in us (#79) 2024-12-06 13:52:50 +08:00
mschfh
17503b032a Add cross-region inference profiles for Llama 3.2 models. (#75) 2024-12-05 11:22:11 +08:00
bkocik
6849ca828a Add cross-region inference profiles for Llama 3.1 models. (#72) 2024-11-20 09:57:35 +08:00
KAEYL98
11a31b5584 feat: add support for APAC claude 3 profiles (#69) 2024-11-07 16:43:15 +08:00
heisenbergye
5f7676608a suppot all Claude models Cross-Region Inference (#65) 2024-10-29 14:43:31 +08:00
Meng Xin Zhu
9cc3ea8253 chore: publish templates to s3 in release workflow (#64) 2024-10-28 17:36:35 +08:00
Aaron Yi
8785c63ddf fix: remove the code review pipeline
until the access right can be grant to pull request from fork
2024-10-25 13:12:59 +08:00
yike5460
0afd0463e1 fix: add debugging info onto workflow 2024-10-25 02:33:26 +00:00
Sergei Mikhailov
3a97677b97 Added "new Claude 3.5 Sonnet" v2 model to the list (#60) 2024-10-23 14:54:45 +08:00
yike5460
728ef6d8a6 fix: update workflow action to user var instead of secret 2024-10-10 06:24:04 +00:00
Mengxin Zhu
46fb759137 chore: use correct Dockerfile for building lambda image 2024-10-09 23:39:37 +08:00
Mengxin Zhu
326e566105 chore: use arm64 architecture image for lambda 2024-10-09 23:15:10 +08:00
Meng Xin Zhu
c1ee1b4244 chore: add automation script to release images (#58) 2024-10-09 18:20:14 +08:00
yike5460
552578a0ee fix: fix action dep issue 2024-10-09 08:30:19 +00:00
yike5460
d9590d6504 fix: place action file into the right folder 2024-10-09 08:22:14 +00:00
yike5460
29d333d367 feat: enable code review and pr description in workflow 2024-10-09 08:13:02 +00:00
Yuki Sekiya
c655f50616 feat: Handle multiple user messages in a single request (#26) 2024-10-09 15:13:58 +08:00
Aiden Dai
97d77ab0c5 Merge pull request #39 from diopres/patch-2
feat: add support for Mistral Large 2 (24.07)
2024-08-14 14:42:25 +08:00
diopres
db0817392f feat: add support for Mistral Large 2 (24.07)
added support for Mistral Large 2 (24.07)
2024-08-12 19:06:44 +05:30
Aiden Dai
2950c15ecb Fix empty response bug 2024-08-09 17:28:09 +08:00
Aiden Dai
f8faf32a76 Add Llama 3.1 without tool call 2024-07-30 12:27:08 +08:00
Aiden Dai
f6b73152bc Update boto3 version 2024-06-25 17:29:47 +08:00
Aiden Dai
66bdfdf5c1 Support Claude 3.5 Sonnet 2024-06-21 10:28:04 +08:00
Aiden Dai
49dd6608a0 Support of tool choice 2024-06-21 10:24:11 +08:00
Aiden Dai
b3509ee0f0 Support multiple tool calls 2024-06-11 16:58:26 +08:00
Aiden Dai
56786f9e32 Update api response 2024-06-11 10:53:56 +08:00
Aiden Dai
6ef7641a0d Update api response 2024-06-07 10:58:44 +08:00
Aiden Dai
5f84cef13a Refactor to use new Converse API 2024-06-04 17:01:06 +08:00
Aiden Dai
f0ea117732 Refactor to use new Converse API 2024-06-04 16:20:25 +08:00
Aiden Dai
696039053d Refactor to use new Converse API 2024-06-04 14:59:40 +08:00
Aiden Dai
86e3db7e09 Merge pull request #20 from didier-durand/patch-2
Update Usage.md: fixing 1 typo + 1 spelling
2024-06-03 09:41:14 +08:00
Aiden Dai
c3b8f3d9e1 Merge pull request #19 from didier-durand/patch-1
Fixing 1 typo in README.md: spelling n section header
2024-06-03 09:41:00 +08:00
Didier Durand
aef0a56016 Update Usage.md: fixing 1 typo + 1 spelling
Update Usage.md: fixing 1 typo + 1 spelling
2024-05-31 18:23:20 +02:00
Didier Durand
ac87a3787d Fixing 1 typo in README.md 2024-05-31 18:16:53 +02:00
Aiden Dai
b141897da4 Merge pull request #16 from aws-samples/dependabot/pip/src/requests-2.32.0
Bump requests from 2.31.0 to 2.32.0 in /src
2024-05-21 16:24:49 +08:00
dependabot[bot]
c7276171ad ---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-05-21 08:06:48 +00:00
Aiden Dai
d7a26dcf8b Update README 2024-05-10 10:21:03 +08:00
Aiden Dai
9f6b334385 Refactor model implementation 2024-05-09 16:58:04 +08:00
Aiden Dai
180c199da9 Add base64 encoded embedding support 2024-04-26 13:46:46 +08:00
Aiden Dai
27d253fddb Add Llama 3 support 2024-04-25 10:04:55 +08:00
Aiden Dai
0512a9b8cc Add Llama 3 support 2024-04-23 10:30:57 +08:00
Aiden Dai
7416f9a4e2 Clean up code 2024-04-18 16:22:06 +08:00
Aiden Dai
8340be4660 Merge pull request #4 from greenjerry/add_claude3_opus
Add support of anthropic.claude-3-opus-20240229-v1:0 model
2024-04-18 11:22:51 +08:00
yhx
7df5617037 Add support of anthropic.claude-3-opus-20240229-v1:0 model 2024-04-17 14:22:46 +08:00