You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Molala, Nagaraj (GE Capital, Non-GE)" <Na...@ge.com> on 2012/08/31 08:58:28 UTC

JIRA Issue : SOLR-3771

GE INTERNAL

Hi,

Is there any option in the configuration to add proxy setting in the
rss-data-config.xml file or any configuration in solr?. Basically
mentioned rss url is able to parse with standalone java program using
proxy setting. while trying to index with the Solr using dataImport, its
giving the UnknownHostException exception. I am attaching the config
files.

<?xml version="1.0" encoding="UTF-8" ?>
<dataConfig>
        <dataSource type="URLDataSource" />
        <document>
                <entity name="IndustryMonitor"
                pk="link"
                url="company URL comes here ......"
                processor="XPathEntityProcessor"
                forEach="/channel | /channel/item"
                transformer="DateFormatTransformer">

                <field column="title" xpath="/rss/channel/item/title" />
                <field column="author" xpath="/rss/channel/item/author"
/>              
                <field column="link" xpath="/rss/channel/item/link" />

</entity>
</document>
</dataConfig>

 

Warm Regards,
Nagaraj Molala
Cell: +91 8447016501
________________________________________
From: Jack Krupansky (JIRA) [jira@apache.org]
Sent: Friday, August 31, 2012 4:28 AM
To: Nagaraj M
Subject: [jira] [Commented] (SOLR-3771) While using RSS indexing from
Solr, we are getting error "Caused by: java.net.UnknownHostException" &
indexing fail.

    [
https://issues.apache.org/jira/browse/SOLR-3771?page=com.atlassian.jira.
plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13445389#
comment-13445389 ]

Jack Krupansky commented on SOLR-3771:
--------------------------------------

Any support questions should be directed to the Solr user mailing list:
solr-user@lucene.apache.org

It still doesn't sound as if there is any evidence being presented here
of a "bug" in Solr.


> While using RSS indexing from Solr, we are getting error "Caused by:
java.net.UnknownHostException" & indexing fail.
>
------------------------------------------------------------------------
--------------------------------------------
>
>                 Key: SOLR-3771
>                 URL: https://issues.apache.org/jira/browse/SOLR-3771
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 3.6
>         Environment: Solr Search engine using RSS data feeding
>            Reporter: Nagaraj Molala
>         Attachments: rss-data-config.xml, schema.xml, solrconfig.xml
>
>
> we are getting below error. Please give us the solution as this is a
show stopper for our application. Attached the config files for your
reference.
>
> https://issues.apache.org/jira/browse/SOLR 2:51 PM
> Caused by: java.net.UnknownHostException: xx.abcd.abcd.com
>        at java.net.PlainSocketImpl.connect(Unknown Source)
>        at java.net.SocksSocketImpl.connect(Unknown Source)
>        at java.net.Socket.connect(Unknown Source)
>        at sun.net.NetworkClient.doConnect(Unknown Source)
>        at sun.net.www.http.HttpClient.openServer(Unknown Source)
>        at sun.net.www.http.HttpClient.openServer(Unknown Source)
>        at sun.net.www.http.HttpClient.<init>(Unknown Source)
>        at sun.net.www.http.HttpClient.New(Unknown Source)
>        at sun.net.www.http.HttpClient.New(Unknown Source)
>        at
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(Unknown
> Source)
>        at
sun.net.www.protocol.http.HttpURLConnection.plainConnect(Unknown Sour
> ce)
>        at sun.net.www.protocol.http.HttpURLConnection.connect(Unknown
Source)
>        at
sun.net.www.protocol.http.HttpURLConnection.getInputStream(Unknown So
> urce)
>        at
org.apache.solr.handler.dataimport.URLDataSource.getData(URLDataSourc
> e.java:97)
>        ... 13 more
>

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA
administrators
For more information on JIRA, see:
http://www.atlassian.com/software/jira

 

Warm Regards,

Nagaraj Molala

Contractor

GE Capital - Americas

 

T :  +91 120 4675233
M : +91 8447016501.

E : nagaraj.molala@ge.com

 

D-195 Sector 63
Noida, UP 201306, INDIA

 


Re: JIRA Issue : SOLR-3771

Posted by Sami Siren <ss...@gmail.com>.
Based on the stack trace it seems that DIH uses URLConnection. You
might want to try setting the proxy related system properties for the
jvm that runs Solr:

http://docs.oracle.com/javase/6/docs/technotes/guides/net/proxies.html

--
 Sami Siren


On Fri, Aug 31, 2012 at 9:58 AM, Molala, Nagaraj (GE Capital, Non-GE)
<Na...@ge.com> wrote:
> GE INTERNAL
>
> Hi,
>
> Is there any option in the configuration to add proxy setting in the
> rss-data-config.xml file or any configuration in solr?. Basically mentioned
> rss url is able to parse with standalone java program using proxy setting.
> while trying to index with the Solr using dataImport, its giving the
> UnknownHostException exception. I am attaching the config files.
>
> <?xml version="1.0" encoding="UTF-8" ?>
> <dataConfig>
>         <dataSource type="URLDataSource" />
>         <document>
>                 <entity name="IndustryMonitor"
>                 pk="link"
>                 url="company URL comes here ......"
>                 processor="XPathEntityProcessor"
>                 forEach="/channel | /channel/item"
>                 transformer="DateFormatTransformer">
>
>                 <field column="title" xpath="/rss/channel/item/title" />
>                 <field column="author" xpath="/rss/channel/item/author" />
>                 <field column="link" xpath="/rss/channel/item/link" />
>
> </entity>
> </document>
> </dataConfig>
>
>
>
> Warm Regards,
> Nagaraj Molala
> Cell: +91 8447016501
> ________________________________________
> From: Jack Krupansky (JIRA) [jira@apache.org]
> Sent: Friday, August 31, 2012 4:28 AM
> To: Nagaraj M
> Subject: [jira] [Commented] (SOLR-3771) While using RSS indexing from Solr,
> we are getting error "Caused by: java.net.UnknownHostException" & indexing
> fail.
>
>     [
> https://issues.apache.org/jira/browse/SOLR-3771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13445389#comment-13445389
> ]
>
> Jack Krupansky commented on SOLR-3771:
> --------------------------------------
>
> Any support questions should be directed to the Solr user mailing list:
> solr-user@lucene.apache.org
>
> It still doesn't sound as if there is any evidence being presented here of a
> "bug" in Solr.
>
>
>> While using RSS indexing from Solr, we are getting error "Caused by:
>> java.net.UnknownHostException" & indexing fail.
>>
>> --------------------------------------------------------------------------------------------------------------------
>>
>>                 Key: SOLR-3771
>>                 URL: https://issues.apache.org/jira/browse/SOLR-3771
>>             Project: Solr
>>          Issue Type: Bug
>>          Components: search
>>    Affects Versions: 3.6
>>         Environment: Solr Search engine using RSS data feeding
>>            Reporter: Nagaraj Molala
>>         Attachments: rss-data-config.xml, schema.xml, solrconfig.xml
>>
>>
>> we are getting below error. Please give us the solution as this is a show
>> stopper for our application. Attached the config files for your reference.
>>
>> https://issues.apache.org/jira/browse/SOLR 2:51 PM
>> Caused by: java.net.UnknownHostException: xx.abcd.abcd.com
>>        at java.net.PlainSocketImpl.connect(Unknown Source)
>>        at java.net.SocksSocketImpl.connect(Unknown Source)
>>        at java.net.Socket.connect(Unknown Source)
>>        at sun.net.NetworkClient.doConnect(Unknown Source)
>>        at sun.net.www.http.HttpClient.openServer(Unknown Source)
>>        at sun.net.www.http.HttpClient.openServer(Unknown Source)
>>        at sun.net.www.http.HttpClient.<init>(Unknown Source)
>>        at sun.net.www.http.HttpClient.New(Unknown Source)
>>        at sun.net.www.http.HttpClient.New(Unknown Source)
>>        at
>> sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(Unknown
>> Source)
>>        at sun.net.www.protocol.http.HttpURLConnection.plainConnect(Unknown
>> Sour
>> ce)
>>        at sun.net.www.protocol.http.HttpURLConnection.connect(Unknown
>> Source)
>>        at
>> sun.net.www.protocol.http.HttpURLConnection.getInputStream(Unknown So
>> urce)
>>        at
>> org.apache.solr.handler.dataimport.URLDataSource.getData(URLDataSourc
>> e.java:97)
>>        ... 13 more
>>
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA
> administrators
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>
>
>
> Warm Regards,
>
> Nagaraj Molala
>
> Contractor
>
> GE Capital – Americas
>
>
>
> T :  +91 120 4675233
> M : +91 8447016501.
>
> E : nagaraj.molala@ge.com
>
>
>
> D-195 Sector 63
> Noida, UP 201306, INDIA
>
>