huggingchat/chat-ui · [NEW] WebSearch 2.0

Hugging Chat org Sep 13, 2023

•

edited Mar 15, 2024

March 15th update: 🌐Internet Access for Assistants

Hi HuggingChat community!

We've just released a big update to the WebSearch feature, it now uses Retrieval-augmented generation (RAG) to extract relevant information from multiple web pages! From our tests it's much more powerful than before 🚀.

We would love to get your feedback on it! Also, if you want to check the details or even contribute, take a look at the PR on Github.

See you soon!

victor changed discussion title from [NEW] - Updated WebSearch - feedback welcome! to [DRAFT][NEW] - Updated WebSearch - feedback welcome! Sep 13, 2023

victor changed discussion title from [DRAFT][NEW] - Updated WebSearch - feedback welcome! to [NEW] - Updated WebSearch - feedback welcome! Sep 13, 2023

victor pinned discussion Sep 13, 2023

victor changed discussion title from [NEW] - Updated WebSearch - feedback welcome! to [NEW] WebSearch 2.0 - feedback welcome! Sep 13, 2023

BramVanroy

Sep 13, 2023

I think I've asked this elsewhere but I'm not sure what the answer was. Do you use a paid API to query Google search? I'm asking because I can imagine that if it's through something hacky like selenium, Google won't like it (and they'll miss ad revenue). So, in short what does the technical pipeline look like for this from user query to generated output?

nsarrazin

Hugging Chat org Sep 13, 2023

I think I've asked this elsewhere but I'm not sure what the answer was. Do you use a paid API to query Google search? I'm asking because I can imagine that if it's through something hacky like selenium, Google won't like it (and they'll miss ad revenue). So, in short what does the technical pipeline look like for this from user query to generated output?

You can check out the feature here!

MoritzLaurer

Sep 13, 2023

@BramVanroy , they use the SerpAPI which, as far as I understand, is paid and legal. see the source code here: https://github.com/huggingface/chat-ui/blob/main/src/lib/server/websearch/searchWeb.ts

MoritzLaurer

Sep 13, 2023

•

edited Sep 13, 2023

Question regarding sources/citations: Do I understand correctly that you currently display all URLs as sources, which the retriever retrieved and gave the LLM as context (regardless of whether the LLM actually used/refers to the source)? In Bing chat, they somehow managed to attribute sources to specific parts/sentences of the generated output and not only the generated output as a whole. I've always been wondering how they made that work (direct citations / attributing sources to specific sub-parts of a generated output). does anyone know?

mishig

Hugging Chat org Sep 14, 2023

•

edited Sep 14, 2023

Question regarding sources/citations: Do I understand correctly that you currently display all URLs as sources, which the retriever retrieved and gave the LLM as context

Yes

In Bing chat, they somehow managed to attribute sources to specific parts/sentences.

Interesting. I guess one simple way would be: for every generated sentence calculate its similarity against the sources and decide the highest scoring source as the source of that sentence. Not sure if that's what they are doing

DarwinAnim8or

Sep 14, 2023

Any chances for a search API that isn't Google's? IE: DuckDuckGo or Bing :)
Awesome work though! Really cool :D

MoritzLaurer

Sep 14, 2023

In Bing chat, they somehow managed to attribute sources to specific parts/sentences.

Interesting. I guess one simple way would be: for every generated sentence calculate its similarity against the sources and decide the highest scoring source as the source of that sentence. Not sure if that's what they are doing

@mishig , yeah true. My first intuition was that I would be hesitant to trust embeddings from a bi-encoder to be reliable enough for this. But if you took a cross encoder, that should actually work quite well. especially if you set a high enough threshold to avoid false positives (then it could even work with a bi-encoder sentence transformer). could be a nice feature to have direct citations :)

bilalazhar50

Sep 14, 2023

•

edited Sep 14, 2023

@mishig first of all thanks for working on the project and making this feature better for everyone , the interface looks like perplexity AI and it was better then i thought , i have been having this issues where it just searches something but then never shows me the answer it shows max tokens
i just keep seeing this
first i see the links of the resources

but then it does nothing at all

should i create the issue on Github

victor

Hugging Chat org Sep 14, 2023

but then never shows me the answer it shows max tokens

Nice catch, never had this.

victor

Hugging Chat org Sep 14, 2023

Any chances for a search API that isn't Google's? IE: DuckDuckGo or Bing :)

Yes that would be cool if someone wants to PR https://github.com/huggingface/chat-ui

olafgeibig

Sep 15, 2023

I tried Llama 2 70B with web search. I asked it about myself and what is my expertise and that it shall include linkedin in its research. It was a complete fail. I tried it three times and it always failed. It didn't search linkedin and it made things up. It said that I'm a medicine professor although I'm a software architect. For many years I had been a freelance developer, you get countless results if you google my name and I have a detailed public profile on linkedin. One answer had search hits on Xing where I also have profile, but it was a different person. Something is going terribly wrong with the web search.

victor

Hugging Chat org Sep 15, 2023

that it shall include linkedin in its research.

Yes, the content of LinkedIn pages is not parsed by the web search. For the Xing page the rendering of users informations seems to happens client-side so it wont work.

zelosleone

Sep 18, 2023

It hallucinates very hard with anything scholarly, sources are almost non existent (The sources provided do not match with the text bot gives) It should be better with integrating pubmed etc free engines to web search.

mishig

Hugging Chat org Sep 18, 2023

•

edited Sep 18, 2023

It hallucinates very hard with anything scholarly, sources are almost non existent (The sources provided do not match with the text bot gives) It should be better with integrating pubmed etc free engines to web search.

@zelosleone I see. Yes right now we are just performing simple google search and picking the top 10 organic results. But as you have said, it is prone to hallucinating on scholarly topics. As a next step, what we can potentially do is: a routing mechanism (i.e. classifier) that classified user search query to an appropriate search engine (pubmed for biomed, arxiv for ML, etc.)

mishig

Hugging Chat org Sep 18, 2023

•

edited Sep 18, 2023

Any chances for a search API that isn't Google's? IE: DuckDuckGo or Bing :)

As Victor said, @DarwinAnim8or not at the moment. But it shouldn't be too hard to abstract away search engine interface and let the user plugin the choice of engine.
Specifically, you can start looking at this part of the codebase: https://github.com/huggingface/chat-ui/blob/main/src/lib/server/websearch/searchWeb.ts

mishig

Hugging Chat org Sep 18, 2023

@bilalazhar50 I see that you are getting the max new tokens error. Did it happen on the first try or was it a continuation of a long existing chat (in which case, you hit the limit of max new tokens) ?

mishig

Hugging Chat org Sep 18, 2023

I tried Llama 2 70B with web search. I asked it about myself and what is my expertise and that it shall include linkedin in its research. It was a complete fail

@krassmann atm, linkedin has a mechanism against scraping, which is why your linkedin was not included in the context. However, we will keep iteratively improving the feature 🤗

engperini

Sep 19, 2023

Any chances for a search API that isn't Google's? IE: DuckDuckGo or Bing :)
Awesome work though! Really cool :D

I worked on a simple and similar chatbot and I found DuckDuckGo doesn't provide links as Google Search API does. I managed to extract the "title" of the 5 organic results, extract the "snippet text" also concatenate it and send to LLM (using Openai functions). I would like to improve it.

Leyline-ecom

Sep 20, 2023

Will you be adding a feature to upload files like pdfs or txt files?

mishig

Hugging Chat org Sep 20, 2023

Will you be adding a feature to upload files like pdfs or txt files?

yes

blanchon

Sep 20, 2023

•

edited Sep 20, 2023

Will you be adding a feature to upload files like pdfs or txt files?

I have an open issue and PR for PDF support, but more specially PDF support with an OCR integration to handle math equation and table. This leverage the Mathpix private API so this feature is not for everyone and will likely not be include in the public version of chat-ui. But at least you can try to copy my code and deploy an instance of chat-ui with this. Maybe this will be include as an optional feature in the future.

Mathpix is currently the only viable OCR solution I know, I don't know any open source solution that could produce such result at the time.

However 90% of the user don't need advanced OCR features and a simple text extraction is enouth, Claude and Perplexity PDF feature work this way and that already quite impressive.

@mishig Did you have more info about the current work on this feature ? I don't see anything on GH expect my issue and PR

blanchon

Sep 20, 2023

Any chances for a search API that isn't Google's? IE: DuckDuckGo or Bing :)

Yes that would be cool if someone wants to PR https://github.com/huggingface/chat-ui

Look like a good idea, if I find the time soon I might try to do this

mishig

Hugging Chat org Sep 21, 2023

@blanchon have you checked https://facebookresearch.github.io/nougat/ yet ?

flozi00

Sep 22, 2023

•

edited Sep 22, 2023

Any chances for a search API that isn't Google's? IE: DuckDuckGo or Bing :)

Yes that would be cool if someone wants to PR https://github.com/huggingface/chat-ui

Look like a good idea, if I find the time soon I might try to do this

https://github.com/huggingface/chat-ui/issues/338#issuecomment-1631135948

Maybe this can help

Smorty100

Sep 23, 2023

•

edited Sep 23, 2023

The LLM seems to forget any previous conversation information. I asked it first about some code for Godot (game engine) without web search. Then afterwards, I asked it to convert this code into the new Godot 4 GDScript and I gave it internet access for that. It asked me, what code I was talking about. Then I prompted it to summarize the conversation, and it only rembers how it was supposed to summarize some non existent code.

Leyline-ecom

Sep 25, 2023

The LLM seems to forget any previous conversation information. I asked it first about some code for Godot (game engine) without web search. Then afterwards, I asked it to convert this code into the new Godot 4 GDScript and I gave it internet access for that. It asked me, what code I was talking about. Then I prompted it to summarize the conversation, and it only rembers how it was supposed to summarize some non existent code.

This is kinda why I asked about the ability to read pdf/txt files. It seems not so good with memory retention.

Kingsloy

Sep 25, 2023

Good!

Anonymous-baba

Sep 26, 2023

This comment has been hidden

yahma

Sep 28, 2023

Would be nice if the websearch could be used ONLY when needed. If I enable the web-search, it performs a web search on EVERY SINGLE QUERY. If I say "Hi, how are you", why does it need to perform a websearch?

nsarrazin

Hugging Chat org Sep 29, 2023

@yahma We've been working on this, it'll come with the agents feature!

SvCy

Sep 29, 2023

•

edited Sep 29, 2023

Websearch issues
searches on every single msg;
indifferent search: doesn't comply with the context of the chat, gets too robotic (ik, ik);
hallucinates;
connection error lots of times

IMG 01

[Dinosaur](https://cdn-uploads.huggingface.co/production/uploads/64b975b696676e40d0ea08aa/gA0A7SmSjHrrSSj6B7A7H.png)

IMG 02

https://cdn-uploads.huggingface.co/production/uploads/64b975b696676e40d0ea08aa/aCZX4HqPJlVTIqHjC_4aa.png

bye openassistant

mishig

Hugging Chat org Oct 2, 2023

•

edited Oct 2, 2023

searches on every single msg;

Would be nice if the websearch could be used ONLY when needed. If I enable the web-search, it performs a web search on EVERY SINGLE QUERY. If I say "Hi, how are you", why does it need to perform a websearch?

At the moment. as long as the Websearch radio is checked on, it will perform websearch on every single message. However, I agree that on the next updates, we should make the LLM decide whether to use websearch or not depending on the user prompt.

indifferent search: doesn't comply with the context of the chat, gets too robotic (ik, ik);
hallucinates;
connection error lots of times

We will work on the prompting as well so that the LLM generations make more sense and follow the context of the chat better

SvCy

Oct 2, 2023

alright! thank you very much!! :)

TomKos413

Oct 5, 2023

Hi, what is the hardware used to run the LLama model that is used in current version, to get this level of response time that can be seen on the current version ?

emg110

Oct 15, 2023

The immediate effect is awesome but I found an issue and that's when the model with websearch enabled keeps the context of previous questions and try to respond to them each time asked a new question and answers them first over and over , again and again! All answers are awesome but the repeating acculative response is an issue!

aichatter

Oct 16, 2023

Hi! Is the app down today? Thanks