You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Bhaskar Devireddy (JIRA)" <ji...@apache.org> on 2012/04/25 17:32:25 UTC
[jira] [Created] (MAHOUT-1001) Performance improvement in
recommenditembased
Bhaskar Devireddy created MAHOUT-1001:
-----------------------------------------
Summary: Performance improvement in recommenditembased
Key: MAHOUT-1001
URL: https://issues.apache.org/jira/browse/MAHOUT-1001
Project: Mahout
Issue Type: Improvement
Components: Collaborative Filtering
Affects Versions: 0.6
Reporter: Bhaskar Devireddy
Assignee: Sean Owen
Fix For: 0.7
While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycle. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-1001) Performance improvement in
recommenditembased
Posted by "Bhaskar Devireddy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Bhaskar Devireddy updated MAHOUT-1001:
--------------------------------------
Description: While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycles. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures. (was: While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycle. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures.)
> Performance improvement in recommenditembased
> ---------------------------------------------
>
> Key: MAHOUT-1001
> URL: https://issues.apache.org/jira/browse/MAHOUT-1001
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Affects Versions: 0.6
> Reporter: Bhaskar Devireddy
> Assignee: Sean Owen
> Fix For: 0.7
>
> Attachments: RowSimilarityJob.patch
>
>
> While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycles. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-1001) Performance improvement in
recommenditembased
Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen updated MAHOUT-1001:
------------------------------
Due Date: 25/Apr/12
Priority: Minor (was: Major)
> Performance improvement in recommenditembased
> ---------------------------------------------
>
> Key: MAHOUT-1001
> URL: https://issues.apache.org/jira/browse/MAHOUT-1001
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Affects Versions: 0.6
> Reporter: Bhaskar Devireddy
> Assignee: Sean Owen
> Priority: Minor
> Fix For: 0.7
>
> Attachments: RowSimilarityJob.patch
>
>
> While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycles. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (MAHOUT-1001) Performance improvement in
recommenditembased
Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen resolved MAHOUT-1001.
-------------------------------
Resolution: Fixed
Looks good, I've committed.
> Performance improvement in recommenditembased
> ---------------------------------------------
>
> Key: MAHOUT-1001
> URL: https://issues.apache.org/jira/browse/MAHOUT-1001
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Affects Versions: 0.6
> Reporter: Bhaskar Devireddy
> Assignee: Sean Owen
> Fix For: 0.7
>
> Attachments: RowSimilarityJob.patch
>
>
> While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycles. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAHOUT-1001) Performance improvement in
recommenditembased
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262016#comment-13262016 ]
Hudson commented on MAHOUT-1001:
--------------------------------
Integrated in Mahout-Quality #1449 (See [https://builds.apache.org/job/Mahout-Quality/1449/])
MAHOUT-1001 optimization of vector allocation (Revision 1330414)
Result = SUCCESS
srowen : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1330414
Files :
* /mahout/trunk/core/src/main/java/org/apache/mahout/math/hadoop/similarity/cooccurrence/RowSimilarityJob.java
> Performance improvement in recommenditembased
> ---------------------------------------------
>
> Key: MAHOUT-1001
> URL: https://issues.apache.org/jira/browse/MAHOUT-1001
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Affects Versions: 0.6
> Reporter: Bhaskar Devireddy
> Assignee: Sean Owen
> Priority: Minor
> Fix For: 0.7
>
> Attachments: RowSimilarityJob.patch
>
>
> While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycles. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-1001) Performance improvement in
recommenditembased
Posted by "Bhaskar Devireddy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Bhaskar Devireddy updated MAHOUT-1001:
--------------------------------------
Attachment: RowSimilarityJob.patch
> Performance improvement in recommenditembased
> ---------------------------------------------
>
> Key: MAHOUT-1001
> URL: https://issues.apache.org/jira/browse/MAHOUT-1001
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Affects Versions: 0.6
> Reporter: Bhaskar Devireddy
> Assignee: Sean Owen
> Fix For: 0.7
>
> Attachments: RowSimilarityJob.patch
>
>
> While running the recommendations with ASFEMail dataset using the example script provided with mahout, we noticed that execution time for unsymmetrify mapper is very long. While profiling the task we noticed a hotspot consuming high CPU cycle. Please find the attached patch addressing issue and optimizes the unsymmetrify mapper class. This patch while retaining functionality(verified the output with and without patch) speeds up the unsymmetrify mapper by more then 5X on x86 architectures.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira