You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (Jira)" <ji...@apache.org> on 2021/05/11 15:48:00 UTC

[jira] [Commented] (TIKA-3391) Refactor fetchiterators to pipesinterators in 2.x, clean up pipesiteratormanager

    [ https://issues.apache.org/jira/browse/TIKA-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342679#comment-17342679 ] 

Hudson commented on TIKA-3391:
------------------------------

SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk8 #226 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/226/])
TIKA-3391 refactor fetchiterators to pipes iterators (tallison: [https://github.com/apache/tika/commit/0fa9adc6e88eb56dc9401b512790e685c660106a])
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-csv/src/main/java/org/apache/tika/pipes/fetchiterator/csv/CSVFetchIterator.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-jdbc/src/test/java/org/apache/tika/pipes/pipesiterator/jdbc/TestJDBCPipesIterator.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/pipesiterator/PipesIterator.java
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-jdbc/pom.xml
* (edit) tika-pipes/tika-pipes-integration-tests/src/test/resources/tika-config-s3ToFs.xml
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-s3/src/test/java/org/apache/tika/pipes/pipesiterator/s3/TestS3PipesIterator.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-csv/src/test/resources/test-simple.csv
* (edit) tika-server/tika-server-client/src/main/java/org/apache/tika/server/client/TikaClientCLI.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-csv/src/test/java/TestCSVPipesIterator.java
* (delete) tika-pipes/tika-fetch-iterators/pom.xml
* (delete) tika-core/src/test/resources/org/apache/tika/config/fetch-iterator-multiple-config.xml
* (delete) tika-core/src/main/java/org/apache/tika/pipes/fetchiterator/FetchIteratorManager.java
* (delete) tika-core/src/test/java/org/apache/tika/pipes/fetchiterator/FileSystemFetchIteratorTest.java
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-csv/pom.xml
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-s3/src/test/resources/log4j.properties
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-s3/src/main/java/org/apache/tika/pipes/pipesiterator/s3/S3PipesIterator.java
* (edit) tika-core/src/main/java/org/apache/tika/pipes/async/AsyncProcessor.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-s3/src/test/resources/log4j.properties
* (add) tika-core/src/test/java/org/apache/tika/pipes/pipesiterator/FileSystemPipesIteratorTest.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-s3/pom.xml
* (add) tika-pipes/tika-pipes-iterators/pom.xml
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-csv/src/main/java/org/apache/tika/pipes/pipesiterator/csv/CSVPipesIterator.java
* (edit) tika-core/src/test/java/org/apache/tika/config/TikaPipesConfigTest.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-jdbc/pom.xml
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-s3/pom.xml
* (edit) tika-core/src/test/java/org/apache/tika/pipes/async/AsyncProcessorTest.java
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-s3/src/test/java/org/apache/tika/pipes/fetchiterator/s3/TestS3FetchIterator.java
* (edit) tika-pipes/pom.xml
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-csv/pom.xml
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-csv/src/test/java/TestCSVFetchIterator.java
* (add) tika-core/src/test/resources/org/apache/tika/config/pipes-iterator-multiple-config.xml
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-jdbc/src/test/resources/log4j.properties
* (add) tika-core/src/test/resources/org/apache/tika/config/pipes-iterator-config.xml
* (edit) tika-pipes/tika-pipes-integration-tests/src/test/java/org/apache/tika/pipes/PipeIntegrationTests.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-jdbc/src/test/resources/log4j.properties
* (delete) tika-core/src/main/java/org/apache/tika/pipes/fetchiterator/FetchIterator.java
* (delete) tika-core/src/test/resources/org/apache/tika/config/fetch-iterator-config.xml
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-csv/src/test/resources/test-simple.csv
* (add) tika-core/src/main/java/org/apache/tika/pipes/pipesiterator/FileSystemPipesIterator.java
* (add) tika-pipes/tika-pipes-iterators/tika-pipes-iterator-jdbc/src/main/java/org/apache/tika/pipes/pipesiterator/jdbc/JDBCPipesIterator.java
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-jdbc/src/test/java/org/apache/tika/pipes/fetchiterator/jdbc/TestJDBCFetchIterator.java
* (edit) tika-core/src/main/java/org/apache/tika/config/ConfigBase.java
* (delete) tika-core/src/main/java/org/apache/tika/pipes/fetchiterator/FileSystemFetchIterator.java
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-s3/src/main/java/org/apache/tika/pipes/fetchiterator/s3/S3FetchIterator.java
* (delete) tika-pipes/tika-fetch-iterators/tika-fetch-iterator-jdbc/src/main/java/org/apache/tika/pipes/fetchiterator/jdbc/JDBCFetchIterator.java
* (edit) tika-server/tika-server-client/src/test/resources/tika-config-simple-fs-emitter.xml
* (edit) tika-pipes/tika-pipes-integration-tests/src/test/resources/tika-config-s3Tos3.xml


> Refactor fetchiterators to pipesinterators in 2.x, clean up pipesiteratormanager
> --------------------------------------------------------------------------------
>
>                 Key: TIKA-3391
>                 URL: https://issues.apache.org/jira/browse/TIKA-3391
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Major
>             Fix For: 2.0.0
>
>
> The fetch iterators do iterate through docs to fetch, but it is more precise to call them pipes iterators because they are also injecting info about the emit stage.  Let's rename FetchIterators to PipesIterators and clean up the PipesIteratorManager (e.g. get rid of it if we can).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)