This is a blueprint that supports both streaming and non streaming calls to DeepSeek's API。You can customize the streaming output mode, real-time or one sentence。
Support multiple rounds of dialogue.
Support the inference model DeepSeeker R1