You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafficserver.apache.org by "Zhao Yongming (Created) (JIRA)" <ji...@apache.org> on 2012/02/17 03:37:59 UTC

[jira] [Created] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Crash report: HttpTransactCache::SelectFromAlternates
-----------------------------------------------------

                 Key: TS-1114
                 URL: https://issues.apache.org/jira/browse/TS-1114
             Project: Traffic Server
          Issue Type: Bug
            Reporter: Zhao Yongming


it may or may not be the upstream issue, let us open it for tracking.

{code}
#0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
    http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
{code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "John Plevyak (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13233161#comment-13233161 ] 

John Plevyak commented on TS-1114:
----------------------------------

This patch has been committed to 3.0.x and master branches.  Please verify and mark this "fixed" if your testing confirms that the problem is gone.
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Zhao Yongming (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zhao Yongming updated TS-1114:
------------------------------

    Fix Version/s:     (was: 3.1.5)
                   3.1.3

this patch runs perfect in our production for weeks
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.3
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Conan Wang (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13239294#comment-13239294 ] 

Conan Wang commented on TS-1114:
--------------------------------

get Hunk #1 FAILED at 1491 when try to backport, because TS-1084 also has a simple modify to the code.
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "weijin (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

weijin updated TS-1114:
-----------------------

    Attachment: cache_crash.diff

write_vector should be protected by vol mutex.
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Zhao Yongming (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13232196#comment-13232196 ] 

Zhao Yongming commented on TS-1114:
-----------------------------------

when we tracking down this issue, we have two directions: 
Weijin is tracking on why the event is "8", where there should not be any event that is "8" in the event system, and in other core dumps we are sure that the event is not what it should be as a really event, it is shown as a random data, that turns out to be something really interest: 1, it should be that the old data(may  or may not be the same event) is freed, and the event is not canceled. 2, someone overwrite the data in this event. Weijin track down this way and it turns out that the action cancel codes may rise some problem under certain situation. He made a patch into our tree, and we applied it on half of our servers, it runs without any crash for weeks.

At the same time, Koutai is working on make the vector write & read more safe, even in some very strange situation. And patched half of our servers, runs without any crash too.

after carefully discuss, we conclude that Weijing's patch is what we need to keep, and here comes the patch.

back to TS-857, when I look it back, there is some strange event in the back trace, we have only , is that the same issue hare? where is the action canceled without mutex protected? if we can consider TS-1114 a good fix, then we should think about TS-857 a crash same as it.

so far, I am not sure how many crashes after patched with TS-1114, I just don't get too much new back trace for this issue, TS-1114 may covered many strange crashes as it will make system really strange.
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "John Plevyak (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13232157#comment-13232157 ] 

John Plevyak commented on TS-1114:
----------------------------------

Gads, yes, the write_vector needs to be protected by the vol mutex.  This is a serious oversight.  Thanx for finding it.

This patch has to get in.  Do you want to commit it or do you want me to do a closer read then commit it? 
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Zhao Yongming (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13210040#comment-13210040 ] 

Zhao Yongming commented on TS-1114:
-----------------------------------

{code}
(gdb) f	1
#1  0x0000000000644387 in CacheVC::openReadChooseWriter	(this=0x2aaab80ff400, event=8, e=<value optimized out>)	at CacheRead.cc:341
(gdb) p	vector
$19 = {magic = 0x0, data = {data = 0x2aaabcc8bc78, fast_data = {{alternate = {m_alt = 0x0}}, {alternate = {m_alt = 0x0}}, {alternate = {
          m_alt	= 0x0}}, {alternate = {m_alt = 0x0}}}, default_val = 0xe85a58, size = 8, pos = 7}, xcount = 8, vector_buf = {m_ptr = 0x0}}
(gdb)  
{code}

                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Leif Hedstrom (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Leif Hedstrom updated TS-1114:
------------------------------

    Fix Version/s: 3.1.3
    
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>             Fix For: 3.1.3
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Assigned] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "weijin (Assigned) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

weijin reassigned TS-1114:
--------------------------

    Assignee: weijin
    
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Zhao Yongming (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13232164#comment-13232164 ] 

Zhao Yongming commented on TS-1114:
-----------------------------------

yeah, we are confidential that we have fixed the crash, and we need your review, that is what we are waiting for :D
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "John Plevyak (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Plevyak updated TS-1114:
-----------------------------

    Backport to Version: 3.0.4
    
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Leif Hedstrom (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Leif Hedstrom updated TS-1114:
------------------------------

    Backport to Version: 3.0.5  (was: 3.0.4)
    
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "Leif Hedstrom (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Leif Hedstrom updated TS-1114:
------------------------------

    Fix Version/s:     (was: 3.1.3)
                   3.1.4
    
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TS-1114) Crash report: HttpTransactCache::SelectFromAlternates

Posted by "John Plevyak (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TS-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13233191#comment-13233191 ] 

John Plevyak commented on TS-1114:
----------------------------------

 Not in 3.0.x yet.  Need to get agreement on a backport.
                
> Crash report: HttpTransactCache::SelectFromAlternates
> -----------------------------------------------------
>
>                 Key: TS-1114
>                 URL: https://issues.apache.org/jira/browse/TS-1114
>             Project: Traffic Server
>          Issue Type: Bug
>            Reporter: Zhao Yongming
>            Assignee: weijin
>             Fix For: 3.1.4
>
>         Attachments: cache_crash.diff
>
>
> it may or may not be the upstream issue, let us open it for tracking.
> {code}
> #0  0x000000000053075e in HttpTransactCache::SelectFromAlternates (cache_vector=0x2aaab80ff500, client_request=0x2aaab80ff4c0, 
>     http_config_params=0x2aaab547b800) at ../../proxy/hdrs/HTTP.h:1375
> 1375	  ((int32_t *) & val)[0] = m_alt->m_object_key[0];
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira