Query Document

introduction the cloudfiles query document flow action enables to execute natural language prompts on the data of the documents that have already been processed using process document using ai docid 31ujx1ligtkwfkjuanbzt action what this action does the cloudfiles query document action allows you to execute a query (a natural language or english language prompt) on the data of a processed document the action processes the query and returns the result as a text output imagine you are processing kyc files uploaded to contact object records using this action, you can classify the uploaded document as a driving license , passport , or neither extract relevant information, such as the passport number , from the document update the corresponding fields on the contact record classification query prompt "check the type of file if it's a driving license, return 1; if it's a passport, return 2; if neither, return 0 " based on the result (e g , 2 for passport), the flow decides the next steps decision element using the query result, the flow branches into specific paths for each document type follow up queries for example, if the classification query returns 2 (passport) , the flow enters the "is passport" path and runs queries like "extract the passport number " "extract the name on the passport " by using multiple query document actions within a flow, you can build complex workflows that analyze and extract data from documents efficiently input parameters let us discuss each of the input parameters in detail, in the sections below processed document id input the processed document id of the file on which you wish to execute the query the processed document id is a unique identifier provided by cloudfiles that represents the data of a processed document how it is generated ? the id is genrated when a file is processed using the process document using ai docid 31ujx1ligtkwfkjuanbzt flow action after this process, a document processed docid\ xrr pnbpocwwcmyg6qgk9 event is automatically published how to retrieve it ? use the get event details flow action associated with the corresponding event to obtain the id alternatively, if you need to execute queries on a salesforce file later or on demand store the file's processed document id in a custom content version field this allows you to query the document anytime, without relying on the event note the same processed document id can be reused multiple times to reference the document and execute queries as needed query input the query (as a plain text) specifying the prompt you wish to execute on the file identified by the input processed document id you can directly pass a string as the query (e g , "extract the passport holder's name " ) alternatively, you can pass the query dynamically through a variable, formula etc allowing to incorporate complex use cases the effectiveness of the cloudfiles query document action heavily depends on the quality of the query or input prompt tips for crafting effective prompts be specific avoid vague prompts like "return expiry date of the passport" instead, use a more detailed query like "return the expiry date of the passport in the format dd/mm/yyyy " define a return format statement as well which should cover all the expected outcomes in order to avoid garbage values and unprecedented results test and refine you can test the quality of your queries and verify if they yield the desired results by using the playground feature in the cloudfiles app testing queries in the playground navigate to cloudfiles app > document ai > playground upload a sample file use the interactive query side component to experiment with different prompts against the file this iterative process helps you fine tune your queries for optimal results before implementing them in your workflows using ai for general queries not all queries need to be executed on a specific document if you want to run a general ai based query that does not require document data, you can simply omit the processed document id when executing the query this allows cloudfiles ai to process the query independently , focusing only on the text input you provide this can be useful in scenarios where you have pre extracted data from documents and need ai to structure or analyze it further you need ai to generate text based responses , summaries, or formatted outputs based on user defined parameters example use case generating an email from extracted data consider a scenario where you have processed an invoice document and extracted key details such as invoice number inv 20240319 001 invoice amount $2,500 00 primary contact name john doe due date march 25, 2025 now, instead of querying the document again, you want ai to generate a structured email using these extracted details how to configure this in salesforce flow you can create a flow text formula resource to format the extracted details into a structured prompt for ai processing 📌 query example "below are the identified details from an invoice invoice number = {invoice number query result}, invoice amount = {invoice amount query result}, primary contact = {client's primary contact name query result}, due date = {invoice due date query result} now, based on the provided details, compose a structured email to the primary contact the email should \ greet the contact by name \ inform them about the invoice details in a professional and concise manner \ politely request confirmation of the invoice details \ provide a closing statement expressing appreciation and offer support if needed " generated email output (expected result) subject invoice confirmation – inv 20240319 001 dear john doe , i hope you are doing well we would like to confirm the details of your latest invoice invoice number inv 20240319 001 invoice amount $2,500 00 due date march 25, 2025 please review the above details and let us know if everything is correct if there are any discrepancies, kindly reply to this email at your earliest convenience looking forward to your confirmation thank you for your time! best regards, \[your name] \[your company] output parameters the output of the cloudfiles query document action is a text (string) data type containing the result of the query as defined in the query prompt where to find the output ? the action's output is available in the flow resource's action section the output is named based on the corresponding action label for example, if the action is labeled as "full dimension table data" , the output flow resource will be named "text from full dimension table data" see it in action here is the flow in action that we have used in this article as an example in this flow we are providing a document to the ai and then prompting queries to determine whether the given document is kyc or not and classifying the type of kyc if it is either dl or passport

Process Document using AI

Query Document (Batch)