RequestHunt is a community-driven platform that collects and curates feature requests from X (Twitter), Reddit, and GitHub. We help product teams discover what features real users are asking for, making it easier to validate product ideas and prioritize feature development based on actual user demand.

How does RequestHunt collect feature requests?

RequestHunt uses AI-powered extraction to automatically collect feature requests from social media platforms including X, Reddit, and GitHub. The platform monitors discussions, issues, and posts to identify genuine user feature requests, extracting key information like the requested feature, product mentioned, and user sentiment.

Who uses RequestHunt?

RequestHunt is used by product managers, founders, developers, and product teams who want to discover what features real users are requesting. It's particularly valuable for startups looking to validate ideas, established companies prioritizing their roadmap, and developers building products that solve real user problems.

Is RequestHunt free to use?

Yes, browsing and searching feature requests on RequestHunt is completely free. We also offer API access for developers who want to integrate feature request data into their own applications. Check our pricing page for API plan details.

How often is RequestHunt data updated?

RequestHunt continuously collects feature requests from Reddit, X (Twitter), and GitHub. You can also trigger on-demand scraping via the API or web interface to get the freshest data for any topic.

RequestHunt

Loading request...

[Ragflow] Allow updating max-tokens for local deployment LLM | RequestHunt

[Ragflow] Allow updating max-tokens for local deployment LLM

| | , , , ,

by @@fengyuhan2019 | 11/8/2024 | submitted by AutoCollector

[view original source]

The user reports that when deploying local LLMs via Xinference in Ragflow, there's no option to set max-tokens on the model page, leading to conversation length issues. They request the ability to update this parameter.

Original Source

@@fengyuhan2019 | 0 pts | 11/8/2024

### Is there an existing issue for the same bug? - [X] I have checked the existing issues. ### Branch name v0.13 ### Commit ID token123 ### Other environment information _No response_ ### Actual behavior 1、使用xinference部署本地大模型，例如qwen系列。 2、登录ragflow,切换到用户管理->模型页面，添加xinference模型，该页面没有max-tokens参数。 3、在全局模型配置中将新增模型设置为全局模型。 4、对话中新建对话，选择新增的模型，提问题，经常回答由于长度限制……继续吗？输入继续无效。 ### Expected behavior _No response_ ### Steps to reproduce ```Markdown 1、使用xinference部署本地大模型，例如qwen系列。 2、登录ragflow,切换到用户管理->模型页面，添加xinference模型，该页面没有max-tokens参数。 3、在全局模型配置中将新增模型设置为全局模型。 4、对话中新建对话，选择新增的模型，此处有max-tokens，对商业API-KEY方式接入大模型有效，对本地部署模型无效。 5、查看模型列表API数据，发展本地部署模型，没有max-tokens参数。 6、提问题，经常回答由于长度限制……继续吗？输入继续无效。 7、根本原因在于添加的本地部署大模型，即用户模型缺少max-tokens字段，模型配置页面和数据库中没有相关字段，导致问答时，调用的大模型实例使用了默认的8192（以前是512）所致。 ``` ### Additional information _No response_

Discussion

Loading comments...