You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by netguy204 <gi...@git.apache.org> on 2017/05/02 14:01:49 UTC

[GitHub] flink issue #3525: [FLINK-6020]add a random integer suffix to blob key to av...

Github user netguy204 commented on the issue:

    https://github.com/apache/flink/pull/3525
  
    +1 I'm looking forward to this fix as I think I'm encountering this bug in production.
    
    I bundle my jobs into a single JAR file with multiple mains. I submit the jobs to the cluster sequentially (once the cluster accepts one I submit the next). My job also has two dependency JARs that I provide via HTTP using the -C switch to flink.
    
    When a job fails it automatically restarts but it seems to cause other jobs from the same JAR to fail and restart as well. The error is always some variation of:
    
    ```
    java.lang.IllegalStateException: zip file closed
    	at java.util.zip.ZipFile.ensureOpen(ZipFile.java:669)
    	at java.util.zip.ZipFile.getEntry(ZipFile.java:309)
    	at java.util.jar.JarFile.getEntry(JarFile.java:240)
    	at sun.net.www.protocol.jar.URLJarFile.getEntry(URLJarFile.java:128)
    	at java.util.jar.JarFile.getJarEntry(JarFile.java:223)
    	at sun.misc.URLClassPath$JarLoader.getResource(URLClassPath.java:1005)
    	at sun.misc.URLClassPath$JarLoader.findResource(URLClassPath.java:983)
    	at sun.misc.URLClassPath.findResource(URLClassPath.java:188)
    	at java.net.URLClassLoader$2.run(URLClassLoader.java:569)
    	at java.net.URLClassLoader$2.run(URLClassLoader.java:567)
    	at java.security.AccessController.doPrivileged(Native Method)
    	at java.net.URLClassLoader.findResource(URLClassLoader.java:566)
    	at java.lang.ClassLoader.getResource(ClassLoader.java:1093)
    	at java.net.URLClassLoader.getResourceAsStream(URLClassLoader.java:232)
            .... backtrace from some arbitrary point in my code that never is doing anything with reflection ...
    ```
    
    The class load that triggers the fault is arbitrary. The same job may fail and restart multiple times in the same day with a different failing class load.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---