You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@manifoldcf.apache.org by Swapna Vuppala <sw...@gmail.com> on 2012/02/10 08:17:35 UTC

Unable to index Windows share repositories

Hi,

Till recently, I have been using ManifoldCF trunk code (before 0.4 was
released) on Windows XP. I was able to index files from Windows Share
repositories successfully into Solr. Now, I have started using ManifoldCF
0.4 version on Windows 7.
With the new setup, am able to index files from File system repository with
no issue, but I have problems indexing data from Windows Share repository.

The job starts and ends with "Result Description : Authorization: Access is
denied." in Simple History. The log file has the message "JCIFS:
Authorization exception reading document/directory
smb://nhance29/TestMails/ - skipping"

Can you please tell me what needs to be done to resolve this ?

I tried enabling Debug from properties.xml and this is what I get in the
log file.

DEBUG 2012-02-10 12:34:37,869 (Startup thread) - Connecting to:
smb://GLOBAL;stgserver:<password>@nhance29/
DEBUG 2012-02-10 12:34:37,907 (Startup thread) - Seed =
'smb://nhance29/TestMails/'
DEBUG 2012-02-10 12:34:39,781 (Worker thread '1') - JCIFS: getVersions():
documentIdentifiers[0] is: smb://nhance29/TestMails/
DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: In checkInclude
for 'smb://nhance29/TestMails/'
DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Matching
startpoint 'smb://nhance29/TestMails/' against actual
'smb://nhance29/TestMails/'
DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint found!
DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint:
always included
DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Leaving
checkInclude for 'smb://nhance29/TestMails/'
DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS: Processing
'smb://nhance29/TestMails/'
DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS:
'smb://nhance29/TestMails/' is a directory
 WARN 2012-02-10 12:34:44,425 (Worker thread '1') - JCIFS: Possibly
transient exception detected on attempt 1 while listing files: Access is
denied.
jcifs.smb.SmbAuthException: Access is denied.
    at jcifs.smb.SmbTransport.checkStatus(SmbTransport.java:546)
    at jcifs.smb.SmbTransport.send(SmbTransport.java:640)
    at jcifs.smb.SmbSession.send(SmbSession.java:238)
    at jcifs.smb.SmbTree.send(SmbTree.java:119)
    at jcifs.smb.SmbFile.send(SmbFile.java:775)
    at jcifs.smb.SmbFile.doFindFirstNext(SmbFile.java:1986)
    at jcifs.smb.SmbFile.doEnum(SmbFile.java:1738)
    at jcifs.smb.SmbFile.listFiles(SmbFile.java:1715)
    at jcifs.smb.SmbFile.listFiles(SmbFile.java:1704)
    at
org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.fileListFiles(SharedDriveConnector.java:2224)
    at
org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:701)
    at
org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
    at
org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561)

Thanks and Regards,
Swapna.

Re: Unable to index Windows share repositories

Posted by Swapna Vuppala <sw...@gmail.com>.
Thanks for the information Karl,

Everything is fine now, I'll reach out to Michael Allen if I see any issue
again.

Thanks and Regards,
Swapna.

On Fri, Feb 10, 2012 at 3:41 PM, Karl Wright <da...@gmail.com> wrote:

> Nothing has changed as far as the connectors are concerned.  Is your
> domain controller now upgraded to a different version of windows too?
> If so you may need to play around with the fields that are used for
> authorization, e.g. the form of the username and/or the domain name.
>
> Windows is not an open platform and they change stuff all the time,
> but to the best of my knowledge they have not introduced any new
> authentication modes in Windows 7, so something should work.  If not
> the guy to talk with is Michael Allen, who maintains the jcifs
> library.
>
> Karl
>
> On Fri, Feb 10, 2012 at 2:17 AM, Swapna Vuppala
> <sw...@gmail.com> wrote:
> > Hi,
> >
> > Till recently, I have been using ManifoldCF trunk code (before 0.4 was
> > released) on Windows XP. I was able to index files from Windows Share
> > repositories successfully into Solr. Now, I have started using ManifoldCF
> > 0.4 version on Windows 7.
> > With the new setup, am able to index files from File system repository
> with
> > no issue, but I have problems indexing data from Windows Share
> repository.
> >
> > The job starts and ends with "Result Description : Authorization: Access
> is
> > denied." in Simple History. The log file has the message "JCIFS:
> > Authorization exception reading document/directory
> smb://nhance29/TestMails/
> > - skipping"
> >
> > Can you please tell me what needs to be done to resolve this ?
> >
> > I tried enabling Debug from properties.xml and this is what I get in the
> log
> > file.
> >
> > DEBUG 2012-02-10 12:34:37,869 (Startup thread) - Connecting to:
> > smb://GLOBAL;stgserver:<password>@nhance29/
> > DEBUG 2012-02-10 12:34:37,907 (Startup thread) - Seed =
> > 'smb://nhance29/TestMails/'
> > DEBUG 2012-02-10 12:34:39,781 (Worker thread '1') - JCIFS: getVersions():
> > documentIdentifiers[0] is: smb://nhance29/TestMails/
> > DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: In
> checkInclude
> > for 'smb://nhance29/TestMails/'
> > DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Matching
> > startpoint 'smb://nhance29/TestMails/' against actual
> > 'smb://nhance29/TestMails/'
> > DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint
> found!
> > DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint:
> > always included
> > DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Leaving
> > checkInclude for 'smb://nhance29/TestMails/'
> > DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS: Processing
> > 'smb://nhance29/TestMails/'
> > DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS:
> > 'smb://nhance29/TestMails/' is a directory
> >  WARN 2012-02-10 12:34:44,425 (Worker thread '1') - JCIFS: Possibly
> > transient exception detected on attempt 1 while listing files: Access is
> > denied.
> > jcifs.smb.SmbAuthException: Access is denied.
> >     at jcifs.smb.SmbTransport.checkStatus(SmbTransport.java:546)
> >     at jcifs.smb.SmbTransport.send(SmbTransport.java:640)
> >     at jcifs.smb.SmbSession.send(SmbSession.java:238)
> >     at jcifs.smb.SmbTree.send(SmbTree.java:119)
> >     at jcifs.smb.SmbFile.send(SmbFile.java:775)
> >     at jcifs.smb.SmbFile.doFindFirstNext(SmbFile.java:1986)
> >     at jcifs.smb.SmbFile.doEnum(SmbFile.java:1738)
> >     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1715)
> >     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1704)
> >     at
> >
> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.fileListFiles(SharedDriveConnector.java:2224)
> >     at
> >
> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:701)
> >     at
> >
> org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
> >     at
> >
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561)
> >
> > Thanks and Regards,
> > Swapna.
>

Re: Unable to index Windows share repositories

Posted by Karl Wright <da...@gmail.com>.
Nothing has changed as far as the connectors are concerned.  Is your
domain controller now upgraded to a different version of windows too?
If so you may need to play around with the fields that are used for
authorization, e.g. the form of the username and/or the domain name.

Windows is not an open platform and they change stuff all the time,
but to the best of my knowledge they have not introduced any new
authentication modes in Windows 7, so something should work.  If not
the guy to talk with is Michael Allen, who maintains the jcifs
library.

Karl

On Fri, Feb 10, 2012 at 2:17 AM, Swapna Vuppala
<sw...@gmail.com> wrote:
> Hi,
>
> Till recently, I have been using ManifoldCF trunk code (before 0.4 was
> released) on Windows XP. I was able to index files from Windows Share
> repositories successfully into Solr. Now, I have started using ManifoldCF
> 0.4 version on Windows 7.
> With the new setup, am able to index files from File system repository with
> no issue, but I have problems indexing data from Windows Share repository.
>
> The job starts and ends with "Result Description : Authorization: Access is
> denied." in Simple History. The log file has the message "JCIFS:
> Authorization exception reading document/directory smb://nhance29/TestMails/
> - skipping"
>
> Can you please tell me what needs to be done to resolve this ?
>
> I tried enabling Debug from properties.xml and this is what I get in the log
> file.
>
> DEBUG 2012-02-10 12:34:37,869 (Startup thread) - Connecting to:
> smb://GLOBAL;stgserver:<password>@nhance29/
> DEBUG 2012-02-10 12:34:37,907 (Startup thread) - Seed =
> 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:39,781 (Worker thread '1') - JCIFS: getVersions():
> documentIdentifiers[0] is: smb://nhance29/TestMails/
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: In checkInclude
> for 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Matching
> startpoint 'smb://nhance29/TestMails/' against actual
> 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint found!
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint:
> always included
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Leaving
> checkInclude for 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS: Processing
> 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS:
> 'smb://nhance29/TestMails/' is a directory
>  WARN 2012-02-10 12:34:44,425 (Worker thread '1') - JCIFS: Possibly
> transient exception detected on attempt 1 while listing files: Access is
> denied.
> jcifs.smb.SmbAuthException: Access is denied.
>     at jcifs.smb.SmbTransport.checkStatus(SmbTransport.java:546)
>     at jcifs.smb.SmbTransport.send(SmbTransport.java:640)
>     at jcifs.smb.SmbSession.send(SmbSession.java:238)
>     at jcifs.smb.SmbTree.send(SmbTree.java:119)
>     at jcifs.smb.SmbFile.send(SmbFile.java:775)
>     at jcifs.smb.SmbFile.doFindFirstNext(SmbFile.java:1986)
>     at jcifs.smb.SmbFile.doEnum(SmbFile.java:1738)
>     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1715)
>     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1704)
>     at
> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.fileListFiles(SharedDriveConnector.java:2224)
>     at
> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:701)
>     at
> org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
>     at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561)
>
> Thanks and Regards,
> Swapna.

Re: Unable to index Windows share repositories

Posted by Karl Wright <da...@gmail.com>.
Good to hear.

The connector, by the way, is resigned to the fact that sometimes
various things fail when talking to Windows, which is why you see the
"transient failure" notification; it will retry on its own eventually
without killing the job, and only give up when things don't work for
an extended period of time.

Karl


On Fri, Feb 10, 2012 at 5:08 AM, Swapna Vuppala
<sw...@gmail.com> wrote:
> Hi,
>
> Not sure why, but now am able to index data from Windows Share repositories
> into Solr. I don't get the Access denied messages any more, although I
> haven't changed anything.
>
> Sorry for the inconvenience caused. Will get back again if I see any issue.
>
> Thanks and Regards,
> Swapna.
>
>
> On Fri, Feb 10, 2012 at 12:47 PM, Swapna Vuppala
> <sw...@gmail.com> wrote:
>>
>> Hi,
>>
>> Till recently, I have been using ManifoldCF trunk code (before 0.4 was
>> released) on Windows XP. I was able to index files from Windows Share
>> repositories successfully into Solr. Now, I have started using ManifoldCF
>> 0.4 version on Windows 7.
>> With the new setup, am able to index files from File system repository
>> with no issue, but I have problems indexing data from Windows Share
>> repository.
>>
>> The job starts and ends with "Result Description : Authorization: Access
>> is denied." in Simple History. The log file has the message "JCIFS:
>> Authorization exception reading document/directory smb://nhance29/TestMails/
>> - skipping"
>>
>> Can you please tell me what needs to be done to resolve this ?
>>
>> I tried enabling Debug from properties.xml and this is what I get in the
>> log file.
>>
>> DEBUG 2012-02-10 12:34:37,869 (Startup thread) - Connecting to:
>> smb://GLOBAL;stgserver:<password>@nhance29/
>> DEBUG 2012-02-10 12:34:37,907 (Startup thread) - Seed =
>> 'smb://nhance29/TestMails/'
>> DEBUG 2012-02-10 12:34:39,781 (Worker thread '1') - JCIFS: getVersions():
>> documentIdentifiers[0] is: smb://nhance29/TestMails/
>> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: In checkInclude
>> for 'smb://nhance29/TestMails/'
>> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Matching
>> startpoint 'smb://nhance29/TestMails/' against actual
>> 'smb://nhance29/TestMails/'
>> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint
>> found!
>> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint:
>> always included
>> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Leaving
>> checkInclude for 'smb://nhance29/TestMails/'
>> DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS: Processing
>> 'smb://nhance29/TestMails/'
>> DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS:
>> 'smb://nhance29/TestMails/' is a directory
>>  WARN 2012-02-10 12:34:44,425 (Worker thread '1') - JCIFS: Possibly
>> transient exception detected on attempt 1 while listing files: Access is
>> denied.
>> jcifs.smb.SmbAuthException: Access is denied.
>>     at jcifs.smb.SmbTransport.checkStatus(SmbTransport.java:546)
>>     at jcifs.smb.SmbTransport.send(SmbTransport.java:640)
>>     at jcifs.smb.SmbSession.send(SmbSession.java:238)
>>     at jcifs.smb.SmbTree.send(SmbTree.java:119)
>>     at jcifs.smb.SmbFile.send(SmbFile.java:775)
>>     at jcifs.smb.SmbFile.doFindFirstNext(SmbFile.java:1986)
>>     at jcifs.smb.SmbFile.doEnum(SmbFile.java:1738)
>>     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1715)
>>     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1704)
>>     at
>> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.fileListFiles(SharedDriveConnector.java:2224)
>>     at
>> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:701)
>>     at
>> org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
>>     at
>> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561)
>>
>> Thanks and Regards,
>> Swapna.
>
>

Re: Unable to index Windows share repositories

Posted by Swapna Vuppala <sw...@gmail.com>.
Hi,

Not sure why, but now am able to index data from Windows Share repositories
into Solr. I don't get the Access denied messages any more, although I
haven't changed anything.

Sorry for the inconvenience caused. Will get back again if I see any issue.

Thanks and Regards,
Swapna.

On Fri, Feb 10, 2012 at 12:47 PM, Swapna Vuppala <swapna.kollipara@gmail.com
> wrote:

> Hi,
>
> Till recently, I have been using ManifoldCF trunk code (before 0.4 was
> released) on Windows XP. I was able to index files from Windows Share
> repositories successfully into Solr. Now, I have started using ManifoldCF
> 0.4 version on Windows 7.
> With the new setup, am able to index files from File system repository
> with no issue, but I have problems indexing data from Windows Share
> repository.
>
> The job starts and ends with "Result Description : Authorization: Access
> is denied." in Simple History. The log file has the message "JCIFS:
> Authorization exception reading document/directory
> smb://nhance29/TestMails/ - skipping"
>
> Can you please tell me what needs to be done to resolve this ?
>
> I tried enabling Debug from properties.xml and this is what I get in the
> log file.
>
> DEBUG 2012-02-10 12:34:37,869 (Startup thread) - Connecting to:
> smb://GLOBAL;stgserver:<password>@nhance29/
> DEBUG 2012-02-10 12:34:37,907 (Startup thread) - Seed =
> 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:39,781 (Worker thread '1') - JCIFS: getVersions():
> documentIdentifiers[0] is: smb://nhance29/TestMails/
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: In checkInclude
> for 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Matching
> startpoint 'smb://nhance29/TestMails/' against actual
> 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint
> found!
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint:
> always included
> DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Leaving
> checkInclude for 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS: Processing
> 'smb://nhance29/TestMails/'
> DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS:
> 'smb://nhance29/TestMails/' is a directory
>  WARN 2012-02-10 12:34:44,425 (Worker thread '1') - JCIFS: Possibly
> transient exception detected on attempt 1 while listing files: Access is
> denied.
> jcifs.smb.SmbAuthException: Access is denied.
>     at jcifs.smb.SmbTransport.checkStatus(SmbTransport.java:546)
>     at jcifs.smb.SmbTransport.send(SmbTransport.java:640)
>     at jcifs.smb.SmbSession.send(SmbSession.java:238)
>     at jcifs.smb.SmbTree.send(SmbTree.java:119)
>     at jcifs.smb.SmbFile.send(SmbFile.java:775)
>     at jcifs.smb.SmbFile.doFindFirstNext(SmbFile.java:1986)
>     at jcifs.smb.SmbFile.doEnum(SmbFile.java:1738)
>     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1715)
>     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1704)
>     at
> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.fileListFiles(SharedDriveConnector.java:2224)
>     at
> org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:701)
>     at
> org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
>     at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561)
>
> Thanks and Regards,
> Swapna.
>