You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Kris Gurusamy <kr...@xpanse.com> on 2020/10/15 20:10:39 UTC

Solr 8.6.3

I've just downloaded solr 8.6.3 and trying to create DIH for loading structured XML. I found out that DIH will be deprecated soon with version 9.0. What is the equivalent of DIH in new solr version? How do I import structured XML data which is very custom and index in Solr new version? Any help is appreciated.

Regards

Kris Gurusamy
Director, Engineering
Kgurusamy@xpanse.com
www.xpanse.com

On 10/15/20, 1:08 PM, "Anshum Gupta (Jira)" <ji...@apache.org> wrote:


         [ https://issues.apache.org/jira/browse/SOLR-14938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Anshum Gupta resolved SOLR-14938.
    ---------------------------------
        Resolution: Invalid

    [~krisgurusamy] - Please ask questions regarding usage on the Solr user mailing list. 

    JIRA is meant for issue tracking purposes.

    > Solr 8.6.3
    > ----------
    >
    >                 Key: SOLR-14938
    >                 URL: https://issues.apache.org/jira/browse/SOLR-14938
    >             Project: Solr
    >          Issue Type: Bug
    >      Security Level: Public(Default Security Level. Issues are Public) 
    >          Components: contrib - DataImportHandler
    >            Reporter: Krishnan
    >            Priority: Major
    >
    > I've just downloaded solr 8.6.3 and trying to create DIH for loading structured XML. I found out that DIH will be deprecated soon with version 9.0. What is the equivalent of DIH in new solr version? How do I import structured XML data which is very custom and index in Solr new version? Any help is appreciated.



    --
    This message was sent by Atlassian Jira
    (v8.3.4#803005)


Re: Solr 8.6.3

Posted by Walter Underwood <wu...@wunderwood.org>.
Solr does not index XML. It has an XML data format for indexing text.

If you want to index and search XML, get MarkLogic. I used to work there.
It is seriously awesome technology.

https://www.marklogic.com <https://www.marklogic.com/>

wunder
Walter Underwood
wunder@wunderwood.org
http://observer.wunderwood.org/  (my blog)

> On Oct 15, 2020, at 1:10 PM, Kris Gurusamy <kr...@xpanse.com> wrote:
> 
> I've just downloaded solr 8.6.3 and trying to create DIH for loading structured XML. I found out that DIH will be deprecated soon with version 9.0. What is the equivalent of DIH in new solr version? How do I import structured XML data which is very custom and index in Solr new version? Any help is appreciated.
> 
> Regards
> 
> Kris Gurusamy
> Director, Engineering
> Kgurusamy@xpanse.com
> www.xpanse.com
> 
> On 10/15/20, 1:08 PM, "Anshum Gupta (Jira)" <ji...@apache.org> wrote:
> 
> 
>         [ https://issues.apache.org/jira/browse/SOLR-14938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
> 
>    Anshum Gupta resolved SOLR-14938.
>    ---------------------------------
>        Resolution: Invalid
> 
>    [~krisgurusamy] - Please ask questions regarding usage on the Solr user mailing list. 
> 
>    JIRA is meant for issue tracking purposes.
> 
>> Solr 8.6.3
>> ----------
>> 
>>                Key: SOLR-14938
>>                URL: https://issues.apache.org/jira/browse/SOLR-14938
>>            Project: Solr
>>         Issue Type: Bug
>>     Security Level: Public(Default Security Level. Issues are Public) 
>>         Components: contrib - DataImportHandler
>>           Reporter: Krishnan
>>           Priority: Major
>> 
>> I've just downloaded solr 8.6.3 and trying to create DIH for loading structured XML. I found out that DIH will be deprecated soon with version 9.0. What is the equivalent of DIH in new solr version? How do I import structured XML data which is very custom and index in Solr new version? Any help is appreciated.
> 
> 
> 
>    --
>    This message was sent by Atlassian Jira
>    (v8.3.4#803005)
> 


Re: Solr 8.6.3

Posted by Gus Heck <gu...@gmail.com>.
Shameless self plug,.. JesterJ (which I maintain) has a stax based xml
extractor
https://github.com/nsoft/jesterj/wiki/Document-Processors#staxextractingprocessor
if you want to try it out.

On Thu, Oct 15, 2020 at 4:32 PM Alexandre Rafalovitch <ar...@gmail.com>
wrote:

> Why not do an XSLT transformation on it before it hits Solr.
>
> Or during if it really has to be in-Solr for some reason
>
> https://lucene.apache.org/solr/guide/8_6/uploading-data-with-index-handlers.html#using-xslt-to-transform-xml-index-updates
>
> But you have more options outside as you could use XQuery instead.
>
> As long as final XML is in Solr format, you are good to go.
>
> Regards,
>     Alex
>
> On Thu., Oct. 15, 2020, 4:13 p.m. Kris Gurusamy, <
> krishnan.gurusamy@xpanse.com> wrote:
>
> > I've just downloaded solr 8.6.3 and trying to create DIH for loading
> > structured XML. I found out that DIH will be deprecated soon with version
> > 9.0. What is the equivalent of DIH in new solr version? How do I import
> > structured XML data which is very custom and index in Solr new version?
> Any
> > help is appreciated.
> >
> > Regards
> >
> > Kris Gurusamy
> > Director, Engineering
> > Kgurusamy@xpanse.com
> > www.xpanse.com
> >
> > On 10/15/20, 1:08 PM, "Anshum Gupta (Jira)" <ji...@apache.org> wrote:
> >
> >
> >          [
> >
> https://issues.apache.org/jira/browse/SOLR-14938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> > ]
> >
> >     Anshum Gupta resolved SOLR-14938.
> >     ---------------------------------
> >         Resolution: Invalid
> >
> >     [~krisgurusamy] - Please ask questions regarding usage on the Solr
> > user mailing list.
> >
> >     JIRA is meant for issue tracking purposes.
> >
> >     > Solr 8.6.3
> >     > ----------
> >     >
> >     >                 Key: SOLR-14938
> >     >                 URL:
> > https://issues.apache.org/jira/browse/SOLR-14938
> >     >             Project: Solr
> >     >          Issue Type: Bug
> >     >      Security Level: Public(Default Security Level. Issues are
> > Public)
> >     >          Components: contrib - DataImportHandler
> >     >            Reporter: Krishnan
> >     >            Priority: Major
> >     >
> >     > I've just downloaded solr 8.6.3 and trying to create DIH for
> loading
> > structured XML. I found out that DIH will be deprecated soon with version
> > 9.0. What is the equivalent of DIH in new solr version? How do I import
> > structured XML data which is very custom and index in Solr new version?
> Any
> > help is appreciated.
> >
> >
> >
> >     --
> >     This message was sent by Atlassian Jira
> >     (v8.3.4#803005)
> >
> >
>


-- 
http://www.needhamsoftware.com (work)
http://www.the111shift.com (play)

Re: Solr 8.6.3

Posted by Alexandre Rafalovitch <ar...@gmail.com>.
Why not do an XSLT transformation on it before it hits Solr.

Or during if it really has to be in-Solr for some reason
https://lucene.apache.org/solr/guide/8_6/uploading-data-with-index-handlers.html#using-xslt-to-transform-xml-index-updates

But you have more options outside as you could use XQuery instead.

As long as final XML is in Solr format, you are good to go.

Regards,
    Alex

On Thu., Oct. 15, 2020, 4:13 p.m. Kris Gurusamy, <
krishnan.gurusamy@xpanse.com> wrote:

> I've just downloaded solr 8.6.3 and trying to create DIH for loading
> structured XML. I found out that DIH will be deprecated soon with version
> 9.0. What is the equivalent of DIH in new solr version? How do I import
> structured XML data which is very custom and index in Solr new version? Any
> help is appreciated.
>
> Regards
>
> Kris Gurusamy
> Director, Engineering
> Kgurusamy@xpanse.com
> www.xpanse.com
>
> On 10/15/20, 1:08 PM, "Anshum Gupta (Jira)" <ji...@apache.org> wrote:
>
>
>          [
> https://issues.apache.org/jira/browse/SOLR-14938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> ]
>
>     Anshum Gupta resolved SOLR-14938.
>     ---------------------------------
>         Resolution: Invalid
>
>     [~krisgurusamy] - Please ask questions regarding usage on the Solr
> user mailing list.
>
>     JIRA is meant for issue tracking purposes.
>
>     > Solr 8.6.3
>     > ----------
>     >
>     >                 Key: SOLR-14938
>     >                 URL:
> https://issues.apache.org/jira/browse/SOLR-14938
>     >             Project: Solr
>     >          Issue Type: Bug
>     >      Security Level: Public(Default Security Level. Issues are
> Public)
>     >          Components: contrib - DataImportHandler
>     >            Reporter: Krishnan
>     >            Priority: Major
>     >
>     > I've just downloaded solr 8.6.3 and trying to create DIH for loading
> structured XML. I found out that DIH will be deprecated soon with version
> 9.0. What is the equivalent of DIH in new solr version? How do I import
> structured XML data which is very custom and index in Solr new version? Any
> help is appreciated.
>
>
>
>     --
>     This message was sent by Atlassian Jira
>     (v8.3.4#803005)
>
>

Re: Solr 8.6.3

Posted by David Smiley <ds...@apache.org>.
Kris,

From a user's standpoint, the DIH is not deprecated.  I think we as a
project screwed up the messaging around components in Solr that are
*moving* in terms of code maintenance.  That is not deprecation yet we
referred to it as such, hence your understandable confusion.  I corrected
the warning about this in 8.7, so you won't see that again.

~ David Smiley
Apache Lucene/Solr Search Developer
http://www.linkedin.com/in/davidwsmiley


On Thu, Oct 15, 2020 at 4:13 PM Kris Gurusamy <kr...@xpanse.com>
wrote:

> I've just downloaded solr 8.6.3 and trying to create DIH for loading
> structured XML. I found out that DIH will be deprecated soon with version
> 9.0. What is the equivalent of DIH in new solr version? How do I import
> structured XML data which is very custom and index in Solr new version? Any
> help is appreciated.
>
> Regards
>
> Kris Gurusamy
> Director, Engineering
> Kgurusamy@xpanse.com
> www.xpanse.com
>
> On 10/15/20, 1:08 PM, "Anshum Gupta (Jira)" <ji...@apache.org> wrote:
>
>
>          [
> https://issues.apache.org/jira/browse/SOLR-14938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> ]
>
>     Anshum Gupta resolved SOLR-14938.
>     ---------------------------------
>         Resolution: Invalid
>
>     [~krisgurusamy] - Please ask questions regarding usage on the Solr
> user mailing list.
>
>     JIRA is meant for issue tracking purposes.
>
>     > Solr 8.6.3
>     > ----------
>     >
>     >                 Key: SOLR-14938
>     >                 URL:
> https://issues.apache.org/jira/browse/SOLR-14938
>     >             Project: Solr
>     >          Issue Type: Bug
>     >      Security Level: Public(Default Security Level. Issues are
> Public)
>     >          Components: contrib - DataImportHandler
>     >            Reporter: Krishnan
>     >            Priority: Major
>     >
>     > I've just downloaded solr 8.6.3 and trying to create DIH for loading
> structured XML. I found out that DIH will be deprecated soon with version
> 9.0. What is the equivalent of DIH in new solr version? How do I import
> structured XML data which is very custom and index in Solr new version? Any
> help is appreciated.
>
>
>
>     --
>     This message was sent by Atlassian Jira
>     (v8.3.4#803005)
>
>