Commit Graph

144 Commits

Author SHA1 Message Date
Sean Smith
b26ee3e9ea Added troubleshooting guide and made buttons cool (#96)
Signed-off-by: Sean Smith <sean.smith@contextual.ai>
2025-02-11 12:40:27 +08:00
Aiden Dai
1cb8a6a603 Update readme 2025-02-10 15:48:34 +08:00
Aiden Dai
c39f6bc942 Use secrets manager for api key 2025-02-10 15:25:12 +08:00
Aiden Dai
74ca3b938e Update architecture diagram 2025-02-10 10:02:43 +08:00
Aiden Dai
a6f3e1176b fix secret access issue 2025-02-09 06:53:23 +08:00
Aiden Dai
4d88731233 Use secrets manager for api key 2025-02-08 21:36:59 +08:00
Sean Smith
48bf360456 Security Guide (#101)
Signed-off-by: Sean Smith <sean.smith@contextual.ai>
2025-02-08 11:40:24 +08:00
yytdfc
093c6fa586 add stop parameter (#86) 2024-12-31 11:15:24 +08:00
Aiden Dai
b2c187c716 Increase connect timeout 2024-12-19 16:45:18 +08:00
Aiden Dai
581638b794 Update docs 2024-12-17 17:38:21 +08:00
Aiden Dai
51bc727b38 Use readme 2024-12-16 17:11:54 +08:00
Aiden Dai
dc067affc0 Use yaml template 2024-12-16 16:33:37 +08:00
Aiden Dai
29621ae59c Automatically detect model list 2024-12-16 16:15:09 +08:00
Aiden Dai
d4938a0af2 Automatically detect model list 2024-12-16 16:01:59 +08:00
Attila Szucs
cb38d328aa Add environment variable for PORT (#47)
* Customizable port

* Fix CMD
2024-12-16 10:00:17 +08:00
Fabio Nonato
4fc0d3bc94 Image error fix (#80)
---------

Co-authored-by: Fabio Nonato <fnp@amazon.com>
2024-12-11 11:26:51 +08:00
Hans Knecht
241d5c0f3e feat: allow the use of an ENV variable to set the API key if the ParameterStore isn't used. (#40) 2024-12-06 14:32:06 +08:00
Fabian Fischer
25b3cfb146 feat: add amazon nova inference profiles in us (#79) 2024-12-06 13:52:50 +08:00
mschfh
17503b032a Add cross-region inference profiles for Llama 3.2 models. (#75) 2024-12-05 11:22:11 +08:00
bkocik
6849ca828a Add cross-region inference profiles for Llama 3.1 models. (#72) 2024-11-20 09:57:35 +08:00
KAEYL98
11a31b5584 feat: add support for APAC claude 3 profiles (#69) 2024-11-07 16:43:15 +08:00
heisenbergye
5f7676608a suppot all Claude models Cross-Region Inference (#65) 2024-10-29 14:43:31 +08:00
Meng Xin Zhu
9cc3ea8253 chore: publish templates to s3 in release workflow (#64) 2024-10-28 17:36:35 +08:00
Aaron Yi
8785c63ddf fix: remove the code review pipeline
until the access right can be grant to pull request from fork
2024-10-25 13:12:59 +08:00
yike5460
0afd0463e1 fix: add debugging info onto workflow 2024-10-25 02:33:26 +00:00
Sergei Mikhailov
3a97677b97 Added "new Claude 3.5 Sonnet" v2 model to the list (#60) 2024-10-23 14:54:45 +08:00
yike5460
728ef6d8a6 fix: update workflow action to user var instead of secret 2024-10-10 06:24:04 +00:00
Mengxin Zhu
46fb759137 chore: use correct Dockerfile for building lambda image 2024-10-09 23:39:37 +08:00
Mengxin Zhu
326e566105 chore: use arm64 architecture image for lambda 2024-10-09 23:15:10 +08:00
Meng Xin Zhu
c1ee1b4244 chore: add automation script to release images (#58) 2024-10-09 18:20:14 +08:00
yike5460
552578a0ee fix: fix action dep issue 2024-10-09 08:30:19 +00:00
yike5460
d9590d6504 fix: place action file into the right folder 2024-10-09 08:22:14 +00:00
yike5460
29d333d367 feat: enable code review and pr description in workflow 2024-10-09 08:13:02 +00:00
Yuki Sekiya
c655f50616 feat: Handle multiple user messages in a single request (#26) 2024-10-09 15:13:58 +08:00
Aiden Dai
97d77ab0c5 Merge pull request #39 from diopres/patch-2
feat: add support for Mistral Large 2 (24.07)
2024-08-14 14:42:25 +08:00
diopres
db0817392f feat: add support for Mistral Large 2 (24.07)
added support for Mistral Large 2 (24.07)
2024-08-12 19:06:44 +05:30
Aiden Dai
2950c15ecb Fix empty response bug 2024-08-09 17:28:09 +08:00
Aiden Dai
f8faf32a76 Add Llama 3.1 without tool call 2024-07-30 12:27:08 +08:00
Aiden Dai
f6b73152bc Update boto3 version 2024-06-25 17:29:47 +08:00
Aiden Dai
66bdfdf5c1 Support Claude 3.5 Sonnet 2024-06-21 10:28:04 +08:00
Aiden Dai
49dd6608a0 Support of tool choice 2024-06-21 10:24:11 +08:00
Aiden Dai
b3509ee0f0 Support multiple tool calls 2024-06-11 16:58:26 +08:00
Aiden Dai
56786f9e32 Update api response 2024-06-11 10:53:56 +08:00
Aiden Dai
6ef7641a0d Update api response 2024-06-07 10:58:44 +08:00
Aiden Dai
5f84cef13a Refactor to use new Converse API 2024-06-04 17:01:06 +08:00
Aiden Dai
f0ea117732 Refactor to use new Converse API 2024-06-04 16:20:25 +08:00
Aiden Dai
696039053d Refactor to use new Converse API 2024-06-04 14:59:40 +08:00
Aiden Dai
86e3db7e09 Merge pull request #20 from didier-durand/patch-2
Update Usage.md: fixing 1 typo + 1 spelling
2024-06-03 09:41:14 +08:00
Aiden Dai
c3b8f3d9e1 Merge pull request #19 from didier-durand/patch-1
Fixing 1 typo in README.md: spelling n section header
2024-06-03 09:41:00 +08:00
Didier Durand
aef0a56016 Update Usage.md: fixing 1 typo + 1 spelling
Update Usage.md: fixing 1 typo + 1 spelling
2024-05-31 18:23:20 +02:00