translate: unify Doc Extractor name in Japanese pages (#776) (backport to release/1.14.0) by RiskeyL · Pull Request #810 · langgenius/dify-docs

RiskeyL · 2026-06-03T02:56:24Z

Backport

This will backport the following commits from main to release/1.14.0:

- translate: unify Doc Extractor name in Japanese pages (#776)

* translate: unify Doc Extractor name in Japanese pages Several Japanese pages translate the Doc Extractor workflow node under inconsistent names (ドキュメントエクストラクター, ドキュメントエクストラクタ, ドキュメント抽出機, 文書抽出機, ドキュメント抽出ノード, テキスト抽出ツール), while other pages — including the most user-facing ones — already use ドキュメント抽出器, the same form used for the analogous Parameter Extractor node (パラメータ抽出器). Align the inconsistent pages with ドキュメント抽出器. Pages that already use ドキュメント抽出器 are left unchanged. Intentionally left untouched: - "Doc Extractor/text" UI variable references in quick-start.mdx - "ドキュメント抽出ライブラリ" in the ETL_TYPE description - Generic "ドキュメント抽出" listed alongside other categories in create-knowledge-pipeline.mdx - Generic "エクストラクタ" used as a category noun in file-upload.mdx * translate: align Doc Extractor term to テキスト抽出 and drop legacy edits --------- Co-authored-by: RiskeyL <7a8y@163.com> (cherry picked from commit 68a357f)

Copilot

Pull request overview

This backport standardizes the Japanese naming for the Doc Extractor node across several end-user docs, aligning references to テキスト抽出 / テキスト抽出ノード in tutorials, node docs, and knowledge pipeline guidance.

Changes:

Renames Doc Extractor mentions to テキスト抽出 / テキスト抽出ノード across multiple Japanese pages.
Updates related headings, captions/alt text, and in-page link text to match the new wording.
Adjusts a few sentences where the node name is referenced in workflow explanations.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
ja/use-dify/tutorials/article-reader.mdx	Updates the section heading and body references for the Doc Extractor node, but still has some leftover inconsistent terminology in the same page.
ja/use-dify/nodes/user-input.mdx	Updates an example routing sentence to refer to テキスト抽出ノード.
ja/use-dify/nodes/doc-extractor.mdx	Renames the node page title/content to テキスト抽出 and updates related captions.
ja/use-dify/knowledge/knowledge-pipeline/knowledge-pipeline-orchestration.mdx	Updates the document processor section and cross-link text to テキスト抽出, plus Doc Extractor label wording.
ja/use-dify/getting-started/quick-start.mdx	Updates the quick start step text to refer to テキスト抽出ノード.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

 ビジネスシーンに応じて、適切なファイルアップロード方法を選択してください。

-### **テキスト抽出ツール**
+### **テキスト抽出**


 PDF、XLSX、DOCXなど多様な形式のドキュメントが存在しますが、LLMはこれらをそのまま扱えません。そのため、抽出器（Extractor）が各種ファイルを解析・変換し、LLMが扱いやすい形式に変換します。

-Difyのドキュメント抽出器、あるいはMarketplaceから「Dify Extractor」「Unstructured」等のツールを選択できます。
+Difyのテキスト抽出ノード、あるいはMarketplaceから「Dify Extractor」「Unstructured」等のツールを選択できます。



 <Tip>
-詳細は[ドキュメント抽出器](/ja/use-dify/nodes/doc-extractor)をご参照ください。
+詳細は[テキスト抽出](/ja/use-dify/nodes/doc-extractor)をご参照ください。


+1. ドキュメントノードの後に、テキスト抽出ノードを追加します。

-2. ドキュメント抽出器ノードのパネルで、`Document/result`を入力変数として設定します。
+2. テキスト抽出ノードのパネルで、`Document/result`を入力変数として設定します。




-ドキュメントエクストラクターノードは、アップロードされたファイルを大規模言語モデルが処理できるテキストに変換します。言語モデルはPDFやDOCXなどのドキュメント形式を直接読み取ることができないため、このノードはファイルアップロードとAI分析の間の重要な橋渡し役を果たします。
+テキスト抽出ノードは、アップロードされたファイルを大規模言語モデルが処理できるテキストに変換します。言語モデルはPDFやDOCXなどのドキュメント形式を直接読み取ることができないため、このノードはファイルアップロードとAI分析の間の重要な橋渡し役を果たします。


Copilot AI review requested due to automatic review settings June 3, 2026 02:56

RiskeyL added the backport label Jun 3, 2026

RiskeyL mentioned this pull request Jun 3, 2026

translate: unify Doc Extractor name in Japanese pages #776

Merged

dosubot Bot added size:M This PR changes 30-99 lines, ignoring generated files. documentation Improvements or additions to documentation labels Jun 3, 2026

Copilot started reviewing on behalf of RiskeyL June 3, 2026 02:56 View session

RiskeyL merged commit 10132d2 into release/1.14.0 Jun 3, 2026
6 checks passed

RiskeyL deleted the backport/release/1.14.0/pr-776 branch June 3, 2026 02:57

Copilot AI reviewed Jun 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

translate: unify Doc Extractor name in Japanese pages (#776) (backport to release/1.14.0)#810

translate: unify Doc Extractor name in Japanese pages (#776) (backport to release/1.14.0)#810
RiskeyL merged 1 commit into
release/1.14.0from
backport/release/1.14.0/pr-776

RiskeyL commented Jun 3, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		ドキュメントエクストラクターノードは、アップロードされたファイルを大規模言語モデルが処理できるテキストに変換します。言語モデルはPDFやDOCXなどのドキュメント形式を直接読み取ることができないため、このノードはファイルアップロードとAI分析の間の重要な橋渡し役を果たします。
		テキスト抽出ノードは、アップロードされたファイルを大規模言語モデルが処理できるテキストに変換します。言語モデルはPDFやDOCXなどのドキュメント形式を直接読み取ることができないため、このノードはファイルアップロードとAI分析の間の重要な橋渡し役を果たします。

Conversation

RiskeyL commented Jun 3, 2026

Backport

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants