Skip to content

Commit f38d47f

Browse files
lambdabaaclaude
andcommitted
Update Dewey connector with new actions and improved docs
Adds 10 new actions to the connector: - Upload document (simple direct upload, no presigned URL required) - Create collection / Delete collection - List documents / Delete document / Retry document - List sections / Scan sections - Get section / Get section chunks Updates README to document all actions, simplify the Getting Started flow (direct upload is now the primary path), and demote the presigned URL flow to an advanced section. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
1 parent 2677bbc commit f38d47f

2 files changed

Lines changed: 785 additions & 52 deletions

File tree

independent-publisher-connectors/Dewey/README.md

Lines changed: 44 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -19,17 +19,47 @@ Fires when document processing fails. Use this to alert your team or retry inges
1919
#### List collections
2020
Returns all collections in your organisation.
2121

22+
#### Create collection
23+
Creates a new collection. Requires a project ID from the Dewey dashboard (Settings → Projects). Use this to provision a collection at the start of a flow before uploading documents.
24+
25+
#### Delete collection
26+
Permanently deletes a collection and all of its documents, sections, and chunks.
27+
2228
#### Get collection
2329
Returns metadata for a single collection by ID.
2430

25-
#### Get document upload URL
26-
Returns a pre-signed S3 URL for a file upload, along with a document ID. After uploading directly to that URL with an HTTP PUT, call **Confirm document upload** to begin processing.
31+
#### List documents
32+
Returns documents in a collection, ordered by newest first. Supports optional filtering by status (`uploading`, `processing`, `ready`, `error`, etc.) and pagination via limit/offset.
33+
34+
#### Delete document
35+
Permanently deletes a document and all of its sections, chunks, and embeddings.
36+
37+
#### Retry document
38+
Re-queues a document that failed processing. Only works on documents with status `error`. Pair with the **When a document has an error** trigger to build automatic retry flows.
39+
40+
#### Upload document
41+
Uploads a file directly to a collection and queues it for processing. This is the simplest way to ingest a document — no hash computation, HTTP PUT, or separate confirmation step required. Pair with **Wait for document** to block until the file is fully indexed.
42+
43+
#### Get document upload URL *(advanced)*
44+
Returns a pre-signed S3 URL for a file upload, along with a document ID. Use this instead of **Upload document** when you need SHA-256 deduplication, want to avoid routing the file payload through Dewey's API (large files), or need to upload from a non-Power Automate client. After uploading directly to that URL with an HTTP PUT, call **Confirm document upload** to begin processing.
2745

2846
#### Confirm document upload
29-
Tells Dewey that the file upload to S3 is complete and processing should begin.
47+
Tells Dewey that the file upload to S3 is complete and processing should begin. Only needed when using the **Get document upload URL** flow.
3048

3149
#### Wait for document
32-
Blocks until the document reaches a terminal state (ready or error) and returns the result. Times out after 5 minutes. Use this after **Confirm document upload** to pause a flow until the document is searchable.
50+
Blocks until the document reaches a terminal state (ready or error) and returns the result. Times out after 5 minutes. Use this after **Upload document** or **Confirm document upload** to pause a flow until the document is searchable.
51+
52+
#### List sections
53+
Returns the heading hierarchy extracted from a document — section IDs, titles, levels, and positions. Use this to navigate document structure or feed section titles into a downstream AI step without loading full content.
54+
55+
#### Scan sections
56+
Searches section titles and summaries across an entire collection using hybrid semantic and keyword matching. Returns a ranked list of the most relevant sections with the document they belong to. Use this to locate the right parts of your corpus before loading content with **Get section** or **Get section chunks**.
57+
58+
#### Get section
59+
Returns a section's metadata and its full Markdown content. Use this after **Scan sections** or **List sections** to read the actual text of a relevant section without loading the entire document.
60+
61+
#### Get section chunks
62+
Returns the individual text chunks that make up a section. Use this instead of **Get section** when you want to feed section content into an AI Builder or Azure OpenAI step as pre-split passages rather than a single Markdown block.
3363

3464
#### Search collection
3565
Runs a hybrid semantic and keyword search against a collection and returns the most relevant text chunks. Ideal for RAG flows where you want to feed context into a subsequent AI Builder or Azure OpenAI step.
@@ -40,6 +70,15 @@ API keys are created in the Dewey dashboard under **Settings → API Keys**. Key
4070
## Getting Started
4171
A typical SharePoint → Dewey ingestion flow:
4272

73+
1. **[SharePoint]** When a file is created in library
74+
2. **[SharePoint]** Get file content
75+
3. **[Dewey]** Upload document *(collectionId, file content, filename)*
76+
4. **[Dewey]** Wait for document *(documentId — blocks until ready or error)*
77+
5. **[Teams / Email]** Notify team — document is now searchable
78+
79+
### Advanced: upload via pre-signed URL
80+
Use this flow when you need explicit deduplication or want to avoid routing large file payloads through Dewey's API:
81+
4382
1. **[SharePoint]** When a file is created in library
4483
2. **[SharePoint]** Get file content
4584
3. **[Dewey]** Get document upload URL *(collectionId, filename, contentType, fileSizeBytes, contentHash)*
@@ -53,4 +92,4 @@ API reference: [meetdewey.com/docs](https://meetdewey.com/docs)
5392
## Known Issues and Limitations
5493
- **Research queries** use Server-Sent Events and cannot be used as a connector action. Use **Search collection** to retrieve relevant chunks and feed them into an AI Builder or Azure OpenAI step instead.
5594
- **Wait for document** holds the flow connection open for up to 5 minutes. For high-volume ingestion flows, set trigger concurrency to limit parallel executions.
56-
- SHA-256 is not natively available in Power Automate expressions. Use `base64(sha256(fileContent))` with `dataUriToString()`, or pass the SharePoint `eTag` as a consistent alternative for deduplication.
95+
- **SHA-256 (advanced flow only)**: SHA-256 is not natively available in Power Automate expressions. If you use **Get document upload URL**, compute the hash with `base64(sha256(fileContent))` and `dataUriToString()`, or pass the SharePoint `eTag` as a consistent proxy. The **Upload document** action handles deduplication internally — no hash required.

0 commit comments

Comments
 (0)