Skip to content

Multimodal RAG with/without images issues #30713

@jlfloresTeika

Description

@jlfloresTeika

Self Checks

  • I have read the Contributing Guide and Language Policy.
  • This is only for bug report, if you would like to ask a question, please head to Discussions.
  • I have searched for existing issues search for existing issues, including closed ones.
  • I confirm that I am using English to submit this report, otherwise it will be closed.
  • 【中文用户 & Non English User】请使用英语提交,否则会被关闭 :)
  • Please do not modify this template :) and fill in all the required fields.

Dify version

latest docker release

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

I did a Multimodal Knowledge with a docx containing images and text, I have created a tool workflow for an Agent to retrieve from the multimodal rag and I find this error consistently, error that in the non-multimodal rag workflows I don't find.

"tool_call_name": "multimodal_rag",
"tool_response": "tool invoke error: read tool response failed: request failed: Attachments variable is not array file or file type."

Image

When I enable the parameter for the images (files array) and feed my chat workflow that is linked with an agent to the multimodal_rag tool workflow the error is:

Object of type File is not JSON serializable

Image

Any help is appreciated, thanks DiFy team and contributors, this feature looks very promising!

For other details you may need to debug, i'm here. ty

✔️ Expected Behavior

The multimodal rag workflow should act like the non-multimodal regarding inputs and ouputs despite image input detail since is optional and it seems to not work as expected.

PD: When I test retrival directly from the retrieval tool it works good. That's why is strange that in an agent with the tool workflow attatched doesn't behave the same.

❌ Actual Behavior

The multimodal rag workflow tool I've configured doesn't behave as expected, I'm experiencing the avobe mentioned issues.

Metadata

Metadata

Assignees

No one assigned

    Labels

    👻 feat:ragEmbedding related issue, like qdrant, weaviate, milvus, vector database.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions