Update README.md
Browse files
README.md
CHANGED
|
@@ -198,7 +198,7 @@ lmdeploy serve api_server ./workspace \
|
|
| 198 |
|
| 199 |
In the above parameters, `server_name` and `server_port` indicate the service address and port, respectively. The `tp` parameter, as mentioned earlier, stands for Tensor Parallelism. The remaining parameter, instance_num, represents the number of instances and can be understood as the batch size. After execution, it will appear as shown below.
|
| 200 |
|
| 201 |
-
After this, users can start the Web Service as described in [
|
| 202 |
|
| 203 |
## Web Service Startup Method 1:
|
| 204 |
|
|
|
|
| 198 |
|
| 199 |
In the above parameters, `server_name` and `server_port` indicate the service address and port, respectively. The `tp` parameter, as mentioned earlier, stands for Tensor Parallelism. The remaining parameter, instance_num, represents the number of instances and can be understood as the batch size. After execution, it will appear as shown below.
|
| 200 |
|
| 201 |
+
After this, users can start the Web Service as described in [TurboMind Service as the Backend](#--turbomind-service-as-the-backend).
|
| 202 |
|
| 203 |
## Web Service Startup Method 1:
|
| 204 |
|