You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@harmony.apache.org by "Vladimir Strigun (JIRA)" <ji...@apache.org> on 2008/03/19 14:34:25 UTC

[jira] Closed: (HARMONY-5599) [drlvm][jit][performance] new movapd instruction for register-to-register copy

     [ https://issues.apache.org/jira/browse/HARMONY-5599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vladimir Strigun closed HARMONY-5599.
-------------------------------------


> [drlvm][jit][performance] new movapd instruction for register-to-register copy
> ------------------------------------------------------------------------------
>
>                 Key: HARMONY-5599
>                 URL: https://issues.apache.org/jira/browse/HARMONY-5599
>             Project: Harmony
>          Issue Type: Improvement
>          Components: DRLVM
>            Reporter: Vladimir Strigun
>            Assignee: Mikhail Fursov
>         Attachments: H-5599.patch, i586.diff, movapd.diff, movapd2.diff
>
>
> Usage of movapd instruction for copy between xmm registers is more efficient in copmarison with partial movsd copy. So, attached patch replace movsd with movadp instruction for such operations. 
> I've checked the patch on scimark bench [1] and got the following results (about 15% speedup for composite score):
> orig build:
> SciMark 2.0a
> Composite Score: 236.8043350027899
> FFT (1024): 266.4183025101507
> SOR (100x100):   410.3833460433766
> Monte Carlo : 31.43640457526972
> Sparse matmult (N=1000, nz=5000): 208.14991492655557
> LU (100x100): 267.6337069585971
> with movapd:
> SciMark 2.0a
> Composite Score: 271.62584550328904
> FFT (1024): 296.11079189672955
> SOR (100x100):   458.00820213602486
> Monte Carlo : 31.406979573247035
> Sparse matmult (N=1000, nz=5000): 208.14991492655557
> LU (100x100): 364.453338983888

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.