You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2012/07/31 14:04:33 UTC

[jira] [Commented] (CONNECTORS-496) Test needed for MySQL that exercises hopcount filtering during a load test

    [ https://issues.apache.org/jira/browse/CONNECTORS-496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425692#comment-13425692 ] 

Karl Wright commented on CONNECTORS-496:
----------------------------------------

According to the client:

>>>>>>
I use MySQL5.5 and CentOS5.8.
I did not make any MySQL setting. I just specified the manifold's database maxhandles to 100.
<<<<<<

Also, no exceptions in the log.

                
> Test needed for MySQL that exercises hopcount filtering during a load test
> --------------------------------------------------------------------------
>
>                 Key: CONNECTORS-496
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-496
>             Project: ManifoldCF
>          Issue Type: Test
>          Components: Framework core
>    Affects Versions: ManifoldCF 0.6
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 0.7
>
>
> User reports that ManifoldCF web crawls run on MySQL fail to find the correct number of documents, compared to web crawls run on PostgreSQL.  The documents included differ from run to run.  We need a test that duplicates the appropriate environment.  >12000 documents, hop-count filtering enabled.
> >   - Max Hop on Links: 15
> >   - Max Hop on Redirects: 10
> >   - Include only hosts matching seeds: Checked
> >   - org.apache.manifoldcf.crawler.threads: 50
> >   - org.apache.manifoldcf.database.maxhandles: 100

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira