You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2020/09/17 17:23:01 UTC

[GitHub] [beam] TheNeuralBit commented on a change in pull request #12704: [BEAM-10603] Implement the new Large Source Recording API.

TheNeuralBit commented on a change in pull request #12704:
URL: https://github.com/apache/beam/pull/12704#discussion_r490431373



##########
File path: sdks/python/apache_beam/runners/interactive/display/pcoll_visualization.py
##########
@@ -407,13 +392,7 @@ def _display_dataframe(self, data, update=None):
           self._is_datatable_empty = False
 
   def _to_dataframe(self):
-    results = []
-    cache_manager = ie.current_env().get_cache_manager(self._pcoll.pipeline)
-    if cache_manager.exists('full', self._cache_key):
-      coder = cache_manager.load_pcoder('full', self._cache_key)
-      reader, _ = cache_manager.read('full', self._cache_key)
-      results = list(to_element_list(reader, coder, include_window_info=True))
-

Review comment:
       This looks like a likely culprit, the error message in the flakes references a `full` directory in the cache:
   
   ```
   E                   PermissionError: [WinError 32] The process cannot access the file because it is being used by another process: 'D:\\a\\beam\\beam\\sdks\\python\\target\\.tox\\py35-win\\tmp\\it-8vh2z7pi2021914046928\\full\\ac8879590f-2021876280456-2021876278608-2021914046928'
   ```




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org