feature: add hunyuan llm support for plugins/ai-proxy #1018

xychen5 · 2024-05-30T09:27:15Z

给出了本地测试千问的配置文件，发现无法测通，请求url是：curl --location 'http://127.0.0.1:10000/v1/chat/completions'
--header 'Content-Type: application/json'
--data '{
"model": "gpt-4-turbo",
"messages": [
{
"role": "user",
"content": "你是谁？"
}
],
"temperature": 0.3
}'

CLAassistant · 2024-05-30T09:27:21Z

All committers have signed the CLA.

CH3CHO · 2024-05-30T09:29:11Z

无法测通是什么意思？报什么错，gateway 容器输出什么

xychen5 · 2024-05-30T09:32:57Z

无法测通是什么意思？报什么错，gateway 容器输出什么
本地测试就打了这一行日志，然后postman等个几分钟，就会返回stream timeout

CH3CHO · 2024-05-30T09:34:56Z

无法测通是什么意思？报什么错，gateway 容器输出什么
本地测试就打了这一行日志，然后postman等个几分钟，就会返回stream timeout

你是用的最新的代码吗？参考一下这个 PR 里的文档：#1005

xychen5 · 2024-05-30T09:38:57Z

无法测通是什么意思？报什么错，gateway 容器输出什么
本地测试就打了这一行日志，然后postman等个几分钟，就会返回stream timeout

你是用的最新的代码吗？参考一下这个 PR 里的文档：#1005

嗯，代码是最新的，这个感谢今天上午的提醒，我先参考这个编译方案试一试

xychen5 · 2024-06-05T07:33:06Z

Ⅰ. Describe what this PR did
feature: add hunyuan llm support for plugins/ai-proxy
Ⅱ. Does this pull request fix one issue?
issue fixed: #944
Ⅲ. Why don't you add test cases (unit test/integration test)?
Ⅳ. Describe how to verify it
参考：https://higress.io/zh-cn/docs/user/wasm-go
需要注意的是，higress/plugins/wasm-go/extensions/ai-proxy/envoy.yaml中的clusters字段，记得改成你需要地址，比如混元的话：就会有如下的一个cluster的配置：

# File generated by hgctl. Modify as required.

admin:
  address:
    socket_address:
      protocol: TCP
      address: 0.0.0.0
      port_value: 9901
static_resources:
  listeners:
    - name: listener_0
      address:
        socket_address:
          protocol: TCP
          address: 0.0.0.0
          port_value: 10000
      filter_chains:
        - filters:
            - name: envoy.filters.network.http_connection_manager
              typed_config:
                "@type": type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager
                scheme_header_transformation:
                  scheme_to_overwrite: https
                stat_prefix: ingress_http
                # Output envoy logs to stdout
                access_log:
                  - name: envoy.access_loggers.stdout
                    typed_config:
                      "@type": type.googleapis.com/envoy.extensions.access_loggers.stream.v3.StdoutAccessLog
                # Modify as required
                route_config:
                  name: local_route
                  virtual_hosts:
                    - name: local_service
                      domains: [ "*" ]
                      routes:
                        - match:
                            prefix: "/"
                          route:
                            cluster: moonshot
                            timeout: 300s
                http_filters:
                  - name: wasmtest
                    typed_config:
                      "@type": type.googleapis.com/udpa.type.v1.TypedStruct
                      type_url: type.googleapis.com/envoy.extensions.filters.http.wasm.v3.Wasm
                      value:
                        config:
                          name: wasmtest
                          vm_config:
                            runtime: envoy.wasm.runtime.v8
                            code:
                              local:
                                filename: /etc/envoy/plugin.wasm
                          configuration:
                            "@type": "type.googleapis.com/google.protobuf.StringValue"
                            value: |
                              {
                               "provider": {
                                 "type": "hunyuan",
                                 "hunyuanAuthKey": "VuR92ugGi04yr0EezLe7lm0FiKzrw27N",
                                 "apiTokens": [
                                    "sk-YGeSIaMRA2oSaDa86NCBVPGKdaiSuQ0YSOGI3nEkfvSb4HdT"
                                  ],
                                 "hunyuanAuthId": "AKID2669UvMvTMJF86HbuMnB1rmdZTEvY2KQ",
                                 "timeout": 1200000,
                                 "modelMapping": {
                                   "*": "hunyuan-lite"
                                 }
                               }
                              }
                  - name: envoy.filters.http.router
  clusters:
    - name: httpbin
      connect_timeout: 30s
      type: LOGICAL_DNS
      # Comment out the following line to test on v6 networks
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: httpbin
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: httpbin
                      port_value: 80
    - name: moonshot
      connect_timeout: 30s
      type: LOGICAL_DNS
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: moonshot
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: hunyuan.tencentcloudapi.com
                      port_value: 443
      transport_socket:
        name: envoy.transport_sockets.tls
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.UpstreamTlsContext
          "sni": "hunyuan.tencentcloudapi.com"

使用如下docker-compose启动该插件：

version: '3.7'
services:
  envoy:
    image: higress-registry.cn-hangzhou.cr.aliyuncs.com/higress/gateway:1.4.0
    entrypoint: /usr/local/bin/envoy
    # 注意这里对wasm开启了debug级别日志，正式部署时则默认info级别
    command: -c /etc/envoy/envoy.yaml --component-log-level wasm:debug
    depends_on:
    - httpbin
    networks:
    - wasmtest
    ports:
    - "10000:10000"
    volumes:
    - ./envoy.yaml:/etc/envoy/envoy.yaml
    - ./out/plugin.wasm:/etc/envoy/plugin.wasm

  httpbin:
    image: kennethreitz/httpbin:latest
    networks:
    - wasmtest
    ports:
    - "12345:80"

networks:
  wasmtest: {}

请求样例如下：

curl --location 'http://127.0.0.1:10000/v1/chat/completions' \
--header 'Content-Type:  application/json' \
--data '{
  "model": "gpt-3",
  "messages": [
    {
      "role": "system",
      "content": "你是一个名专业的开发人员！"
    },
    {
      "role": "user",
      "content": "你好，你是谁？"
    }
  ],
  "temperature": 0.3,
  "stream": false
}'

而后你就可以在本地的pod中查看相应的日志输出：

Ⅴ. Special notes for reviews
no

plugins/wasm-go/extensions/ai-proxy/README.md

plugins/wasm-go/extensions/ai-proxy/provider/hunyuan.go

plugins/wasm-go/extensions/ai-proxy/util/authorize.go

plugins/wasm-go/extensions/ai-proxy/provider/provider.go

xychen5 · 2024-06-05T09:34:12Z

hi，我已经按照意见进行了修改，请再review一下吧^v^ @CH3CHO

plugins/wasm-go/extensions/ai-proxy/README.md

plugins/wasm-go/extensions/ai-proxy/provider/hunyuan.go

xychen5 · 2024-06-06T03:27:15Z

hi，我已经按照意见进行了修改，请辛苦再review一下吧^v^ @CH3CHO

CH3CHO · 2024-06-06T09:23:25Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.1 out of 2 committers have signed the CLA.✅ CH3CHO❌ xychen5You have signed the CLA already but the status is still pending? Let us recheck it.

麻烦按照上面的提示签署一下 CLA。 @xychen5

xychen5 · 2024-06-06T09:28:37Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.1 out of 2 committers have signed the CLA.✅ CH3CHO❌ xychen5You have signed the CLA already but the status is still pending? Let us recheck it.

麻烦按照上面的提示签署一下 CLA。 @xychen5

done，感谢review~

xychen5 mentioned this pull request May 30, 2024

AI 代理 Wasm 插件对接腾讯混元大模型 #944

Closed

feature: add hunyuan llm support for plugins/ai-proxy

0a17538

xychen5 force-pushed the feature/aiProxyWasmPluginForTencentLLM branch from 9458869 to 0a17538 Compare June 5, 2024 07:08

xychen5 changed the title ~~TryFix: qwen本地aiproxy测试不通~~ feature: add hunyuan llm support for plugins/ai-proxy Jun 5, 2024

xychen5 marked this pull request as ready for review June 5, 2024 07:33

xychen5 requested review from johnlanni, WeixinX and CH3CHO as code owners June 5, 2024 07:33

CH3CHO reviewed Jun 5, 2024

View reviewed changes

update: cr modification

c5be05e

CH3CHO reviewed Jun 5, 2024

View reviewed changes

plugins/wasm-go/extensions/ai-proxy/README.md Show resolved Hide resolved

plugins/wasm-go/extensions/ai-proxy/provider/hunyuan.go Outdated Show resolved Hide resolved

update: cr modification

3b1898a

CH3CHO approved these changes Jun 6, 2024

View reviewed changes

Merge branch 'main' into feature/aiProxyWasmPluginForTencentLLM

c7983a0

CH3CHO merged commit c6e3db9 into alibaba:main Jun 6, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: add hunyuan llm support for plugins/ai-proxy #1018

feature: add hunyuan llm support for plugins/ai-proxy #1018

xychen5 commented May 30, 2024

CLAassistant commented May 30, 2024 •

edited

Loading

CH3CHO commented May 30, 2024

xychen5 commented May 30, 2024

CH3CHO commented May 30, 2024

xychen5 commented May 30, 2024

xychen5 commented Jun 5, 2024

xychen5 commented Jun 5, 2024 •

edited

Loading

xychen5 commented Jun 6, 2024

CH3CHO commented Jun 6, 2024

xychen5 commented Jun 6, 2024

feature: add hunyuan llm support for plugins/ai-proxy #1018

feature: add hunyuan llm support for plugins/ai-proxy #1018

Conversation

xychen5 commented May 30, 2024

CLAassistant commented May 30, 2024 • edited Loading

CH3CHO commented May 30, 2024

xychen5 commented May 30, 2024

CH3CHO commented May 30, 2024

xychen5 commented May 30, 2024

xychen5 commented Jun 5, 2024

xychen5 commented Jun 5, 2024 • edited Loading

xychen5 commented Jun 6, 2024

CH3CHO commented Jun 6, 2024

xychen5 commented Jun 6, 2024

CLAassistant commented May 30, 2024 •

edited

Loading

xychen5 commented Jun 5, 2024 •

edited

Loading