RequestHunt is a community-driven platform that collects and curates feature requests from X (Twitter), Reddit, and GitHub. We help product teams discover what features real users are asking for, making it easier to validate product ideas and prioritize feature development based on actual user demand.

How does RequestHunt collect feature requests?

RequestHunt uses AI-powered extraction to automatically collect feature requests from social media platforms including X, Reddit, and GitHub. The platform monitors discussions, issues, and posts to identify genuine user feature requests, extracting key information like the requested feature, product mentioned, and user sentiment.

Who uses RequestHunt?

RequestHunt is used by product managers, founders, developers, and product teams who want to discover what features real users are requesting. It's particularly valuable for startups looking to validate ideas, established companies prioritizing their roadmap, and developers building products that solve real user problems.

Is RequestHunt free to use?

Yes, browsing and searching feature requests on RequestHunt is completely free. We also offer API access for developers who want to integrate feature request data into their own applications. Check our pricing page for API plan details.

How often is RequestHunt data updated?

RequestHunt continuously collects feature requests from Reddit, X (Twitter), and GitHub. You can also trigger on-demand scraping via the API or web interface to get the freshest data for any topic.

[Triton Inference Server] vLLM/OpenAI Compatible Endpoint

The user requests an OpenAI compatible endpoint for the vLLM backend in Triton Inference Server, similar to the one that ships with vLLM itself.

Original Source

@@Elsayed91 | 12 pts | 3/10/2024

**Is your feature request related to a problem? Please describe.** vLLM backend works well and is easy to set up, compared to TensorRT which had me pulling my hair. However it lacks the OpenAI compatible endpoint that ships with vLLM itself. The `/generate` endpoint on its own requires work to setup for chat applications (that I honestly don't know how to do). In essence, just by adopting vLLM triton instead of vLLM, you have to develop classes and interfaces for all these things. Not to mention that LangChain has no LLM implementation and LlamaIndex's is a bit primitive, undocumented and bugs out. **Describe the solution you'd like** Include vLLM's OpenAI compatible endpoint as an endpoint while using Triton. **Additional context** Pros: - Better integration with Langchain (through `ChatOpenAI`) and LlamaIndex - Triton becomes orders of magnitude easier to setup, run and migrate to (i.e you don't have to rebuild your whole toolset to accommodate Triton) - Be

Discussion

Loading comments...