Abid Ali Awan

kingabzpro

AI & ML interests

LLMs, MLOps, ASR, & RL

Organizations

kingabzpro's activity

replied to their post 7 days ago
view reply

@victor I think the community is eagerly awaiting the next big month-long event, where the community can come together to build something, like we used to do in the past.

posted an update 7 days ago
view post
Post
854
I believe Hugging Face should have something similar to Hacktoberfest. I miss the days when there were events like this every 3 months for audio, deep reinforcement learning, gradio themes, but it turns out everything slowed down. There are no more Hugging Face events.
@victor
  • 3 replies
ยท
posted an update 10 days ago
view post
Post
1067
I never imagined that Jenkins could be as powerful and easy to implement as GitHub Actions. Loving it. ๐Ÿฅฐ
replied to their post 10 days ago
view reply

I'm having some issues with the RAG pipeline. It generally takes 0.2-2 seconds for it to respond, and most of the time the embedding model takes even longer. I can implement prompt caching, but I was considering a more hardware-related solution. What do you think about using Ray for distributed serving? Also, what do you think about GraphQL?

posted an update 13 days ago
view post
Post
1820
How can I make my RAG application generate real-time responses? Up until now, I have been using Groq for fast LLM generation and the Gradio Live function. I am looking for a better solution that can help me build a real-time application without any delay. @abidlabs

kingabzpro/Real-Time-RAG
  • 2 replies
ยท