Hello , nice to be here in this awesome community. I am trying to build a service which uses llam 2 7b to extract specific insights and summaries from a conversation transcript between two people. How can I successfully deploy this model and serve is through FastApi.
Anyone with the experience or knowlwdge to guide me. Will truly appreciate this. I specifically want to know how to do this using 1) SageMaker 2) deploying my own instance to handle everything