You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jackrabbit.apache.org by "Paco Avila (JIRA)" <ji...@apache.org> on 2008/04/04 23:19:25 UTC

[jira] Created: (JCR-1521) Text Extractors are executed twice

Text Extractors are executed twice
----------------------------------

                 Key: JCR-1521
                 URL: https://issues.apache.org/jira/browse/JCR-1521
             Project: Jackrabbit
          Issue Type: Bug
          Components: jackrabbit-text-extractors
    Affects Versions: 1.4
         Environment: JDK 1.5 Ubuntu Gutsy
            Reporter: Paco Avila


I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paco Avila updated JCR-1521:
----------------------------

    Attachment: ExifTextExtractor.java

This implements an (initial) text extractor for EXIF info stored in JPEG files. Need a jar from http://www.drewnoakes.com/code/exif/index.html

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12589107#action_12589107 ] 

Paco Avila commented on JCR-1521:
---------------------------------

Ja, ja... good but it is very annoying force text extraction on checkout. The checkout opperation does no modify any "interesting" node property which justify node content reindexing. 

A, from my point of view, good way to avoid reindex binary can be add a checksum property which can be used to check if the binary content is modified. Or put a isModified property with is set to true when user do ContenNode.setProperty("jcr:data", xxx) .

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587538#action_12587538 ] 

Paco Avila commented on JCR-1521:
---------------------------------

No, this is done through jcr api.

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Dave Brosius (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587462#action_12587462 ] 

Dave Brosius commented on JCR-1521:
-----------------------------------

It might be useful to put this


Exception e = new Exception();
e.fillInStackTrace();
e.printStackTrace();

in your extractText method, to show folks where the two calls are coming from.

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paco Avila updated JCR-1521:
----------------------------

    Attachment: DummyMyTextExtractor.java

Simple dummy text extractor

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paco Avila updated JCR-1521:
----------------------------

    Attachment: DummyTextExtractor.java

TextExtractor is executer after Node.save(), Node.checking() and Node.checkout()

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587888#action_12587888 ] 

monkiki edited comment on JCR-1521 at 4/11/08 1:53 AM:
----------------------------------------------------------

I can confirm that Upload and document creation is executen only one time. And ExifTextExtractor is executed twices because of log info messages.

If the upload if executed twice and ItemAlreadyExistsException should be thrown and it is working fine.

      was (Author: monkiki):
    I can confirm that Upload and document creation is executen only one time. And ExifTextExtractor is executed twices because of log info messages.
  
> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12589049#action_12589049 ] 

Jukka Zitting commented on JCR-1521:
------------------------------------

> TextExtractor is executer after Node.save(), Node.checking() and Node.checkout() 

It's not a bug, it's a feature! :-) All those operations modify the node, causing text extraction to run.

There's been some talk about removing the need to reindex binaries if they haven't changed, but so far there hasn't been any good way to implement that.

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Dave Brosius (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587751#action_12587751 ] 

Dave Brosius commented on JCR-1521:
-----------------------------------

Based on the stack traces, i see this line in both

at es.git.openkm.frontend.server.OKMFileUploadServlet.doPost(OKMFileUploadServlet.java:123)



To me that means that you are receiving a double post from the web side, and thus saving the item twice.

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646585#action_12646585 ] 

Paco Avila commented on JCR-1521:
---------------------------------

I'm interested in correct this behavior, but not sure how can I do it. There is more users interested in this issue and want to meet a good solution? I think that a good way to avoid re-index the data is add a new property which contains a checksum for jcr:data property.

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Improvement
>          Components: indexing, jackrabbit-core
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (JCR-1521) Text Extractors are executed twice

Posted by "Marcel Reutegger (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marcel Reutegger resolved JCR-1521.
-----------------------------------

    Resolution: Invalid

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-1521) Text Extractors are executed twice

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting updated JCR-1521:
-------------------------------

    Component/s:     (was: jackrabbit-text-extractors)
                 jackrabbit-core
                 indexing
     Issue Type: Improvement  (was: Bug)

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Improvement
>          Components: indexing, jackrabbit-core
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Reopened: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paco Avila reopened JCR-1521:
-----------------------------


The text extractor is executed after Node.save() and after Node.checkin()

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paco Avila updated JCR-1521:
----------------------------

    Attachment: DummyTextExtractor.java

Sample code to reproduce issue

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587888#action_12587888 ] 

Paco Avila commented on JCR-1521:
---------------------------------

I can confirm that Upload and document creation is executen only one time. And ExifTextExtractor is executed twices because of log info messages.

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Marcel Reutegger (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587194#action_12587194 ] 

Marcel Reutegger commented on JCR-1521:
---------------------------------------

I'm not able to reproduce this issue.

Can you please be more specific? What methods do you call? Do you have a test case?

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paco Avila updated JCR-1521:
----------------------------

    Attachment: StackTrace.txt

This is the stackrtrace generated.

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587226#action_12587226 ] 

Paco Avila commented on JCR-1521:
---------------------------------

I have uploaded my ExitTextExtractor. This extractor is executed twice each time I put a jpg file into the repository. Used with

* jackrabbit-api-1.4.jar
* jackrabbit-core-1.4.2.jar
* jackrabbit-jcr2spi-1.4.jar
* jackrabbit-jcr-commons-1.4.2.jar
* jackrabbit-spi-1.4.jar
* jackrabbit-spi-commons-1.4.jar
* jackrabbit-text-extractors-1.4.jar


> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (JCR-1521) Text Extractors are executed twice

Posted by "Dave Brosius (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587751#action_12587751 ] 

dbrosius edited comment on JCR-1521 at 4/10/08 6:54 PM:
------------------------------------------------------------

Based on the stack traces, i see this line in both

at es.git.openkm.frontend.server.OKMFileUploadServlet.doPost(OKMFileUploadServlet.java:123)



To me that means that you are receiving a double post from the web side, and thus saving the item twice.


BAH! i confused myself into thinking what is really the milliseconds was a thread id, and saw two different ones in the above line. Nevermind what i said.

      was (Author: dbrosius):
    Based on the stack traces, i see this line in both

at es.git.openkm.frontend.server.OKMFileUploadServlet.doPost(OKMFileUploadServlet.java:123)



To me that means that you are receiving a double post from the web side, and thus saving the item twice.
  
> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-1521) Text Extractors are executed twice

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587524#action_12587524 ] 

Jukka Zitting commented on JCR-1521:
------------------------------------

A wild guess, are you using WebDAV to import files into the repository?

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: ExifTextExtractor.java
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-1521) Text Extractors are executed twice

Posted by "Paco Avila (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paco Avila updated JCR-1521:
----------------------------

    Attachment:     (was: DummyTextExtractor.java)

> Text Extractors are executed twice
> ----------------------------------
>
>                 Key: JCR-1521
>                 URL: https://issues.apache.org/jira/browse/JCR-1521
>             Project: Jackrabbit
>          Issue Type: Bug
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.4
>         Environment: JDK 1.5 Ubuntu Gutsy
>            Reporter: Paco Avila
>         Attachments: DummyMyTextExtractor.java, DummyTextExtractor.java, ExifTextExtractor.java, StackTrace.txt
>
>
> I'have created a test text extractor and the method extractText() is invoked twice. It it really neccesaty or it's a bug?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.