You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@streampipes.apache.org by "Dominik Riemer (Jira)" <ji...@apache.org> on 2022/11/26 15:31:00 UTC
[jira] [Commented] (STREAMPIPES-582) Redesign the remove duplicates function for adapters
[ https://issues.apache.org/jira/browse/STREAMPIPES-582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17639475#comment-17639475 ]
Dominik Riemer commented on STREAMPIPES-582:
--------------------------------------------
This issue has been migrated to https://github.com/apache/streampipes/issues/761
> Redesign the remove duplicates function for adapters
> ----------------------------------------------------
>
> Key: STREAMPIPES-582
> URL: https://issues.apache.org/jira/browse/STREAMPIPES-582
> Project: StreamPipes
> Issue Type: Improvement
> Components: Connect
> Reporter: Philipp Zehnder
> Priority: Major
> Fix For: 1.0.0
>
>
> h2. Current functionality
> A user can select the *remove duplicate* option in the start adapter tab and select the *time interval* how long events are stored in cache (class: {{{}org.apache.streampipes.connect.adapter.preprocessing.transform.stream.DuplicateFilterPipelineElement{}}})
> h3. Problems
> * The state is not persisted when an adapter is restarted
> * No option to ignore the timestamp field
> h3. New functionality
> * A user should be able to add the remove duplicate rule with the option to ignore the timestamp field.
> * A user should be able to restart an adapter without loosing the state of the remove duplicate rule
> h3. Info
> * To do that we need to extend the API of adapters and add the functionality to execute code when an adapter is stopped
> * An API to load data (on startup) and persist (on shutdown) for elements is required
--
This message was sent by Atlassian Jira
(v8.20.10#820010)