Context Navigation

← Previous Changeset
Next Changeset →

Changeset 58314 in webkit

Timestamp:

Apr 27, 2010 10:50:03 AM (14 years ago)

Author:

eric@webkit.org

Message:

2010-04-27 Eric Seidel <eric@webkit.org>

Reviewed by Adam Barth.

[chromium] new-run-webkit-tests hangs on Chromium Bots (OS X and Linux)
https://bugs.webkit.org/show_bug.cgi?id=37987

After further research, I believe the hang is caused by:
http://bugs.python.org/issue2320
Basically Popen() is not reentrant.
The workaround is to pass close_fds=True to Popen() on Mac/Linux.

I fixed our main Popen wrapper "Executive.run_command" to use close_fds=True
when appropriate.

I audited all places we call Popen() and either moved them to run_command
or left a FIXME that they are not thread safe. A few places I added the
close_fds workaround there and left an explanitory note.

Scripts/webkitpy/common/checkout/scm_unittest.py:
- Added note that this Popen use is not threadsafe.
Scripts/webkitpy/common/system/executive.py:
- Fixed our Executive.run_* to workaround python bug 2320.
Scripts/webkitpy/common/system/user.py: _ Added note that this Popen use is not threadsafe.
Scripts/webkitpy/layout_tests/layout_package/json_results_generator.py: ditto.
Scripts/webkitpy/layout_tests/port/apache_http_server.py: ditto.
Scripts/webkitpy/layout_tests/port/base.py:
- Change wdiff back to using run_command now that we believe it to be threadsafe.
Scripts/webkitpy/layout_tests/port/chromium.py:
- Fix to use Executive in places.
- Pass self._executive down to the Driver for easier unit testing.
Scripts/webkitpy/layout_tests/port/chromium_win.py:
- Re-factor to use a _kill_all method.
- Made the _kill_all method use run_command to be threadsafe.
Scripts/webkitpy/layout_tests/port/http_server.py:
- Add FIXME about using Executive.
Scripts/webkitpy/layout_tests/port/server_process.py:
- Use Executive to be threadsafe.
Scripts/webkitpy/layout_tests/port/webkit.py:
- Pass self._executive down to the Driver.
Scripts/webkitpy/layout_tests/port/websocket_server.py:
- Add note about Popen not being threadsafe.
Scripts/webkitpy/layout_tests/rebaseline_chromium_webkit_tests.py:
- Move one caller to run_command add notes about moving others.

Location:

trunk/WebKitTools

Files:

: 14 edited

ChangeLog (modified) (1 diff)
Scripts/webkitpy/common/checkout/scm_unittest.py (modified) (1 diff)
Scripts/webkitpy/common/system/executive.py (modified) (4 diffs)
Scripts/webkitpy/common/system/user.py (modified) (1 diff)
Scripts/webkitpy/layout_tests/layout_package/json_results_generator.py (modified) (2 diffs)
Scripts/webkitpy/layout_tests/port/apache_http_server.py (modified) (1 diff)
Scripts/webkitpy/layout_tests/port/base.py (modified) (2 diffs)
Scripts/webkitpy/layout_tests/port/chromium.py (modified) (9 diffs)
Scripts/webkitpy/layout_tests/port/chromium_win.py (modified) (2 diffs)
Scripts/webkitpy/layout_tests/port/http_server.py (modified) (1 diff)
Scripts/webkitpy/layout_tests/port/server_process.py (modified) (5 diffs)
Scripts/webkitpy/layout_tests/port/webkit.py (modified) (4 diffs)
Scripts/webkitpy/layout_tests/port/websocket_server.py (modified) (1 diff)
Scripts/webkitpy/layout_tests/rebaseline_chromium_webkit_tests.py (modified) (4 diffs)

Legend:

: Unmodified
: Added
: Removed

trunk/WebKitTools/ChangeLog

-                      r58297
+                      r58314
+-04-27  Eric Seidel  <eric@webkit.org>
+        Reviewed by Adam Barth.
+        [chromium] new-run-webkit-tests hangs on Chromium Bots (OS X and Linux)
+        https://bugs.webkit.org/show_bug.cgi?id=37987
+        After further research, I believe the hang is caused by:
+        http://bugs.python.org/issue2320
+        Basically Popen() is not reentrant.
+        The workaround is to pass close_fds=True to Popen() on Mac/Linux.
+        I fixed our main Popen wrapper "Executive.run_command" to use close_fds=True
+        when appropriate.
+        I audited all places we call Popen() and either moved them to run_command
+        or left a FIXME that they are not thread safe.  A few places I added the
+        close_fds workaround there and left an explanitory note.
+        * Scripts/webkitpy/common/checkout/scm_unittest.py:
+         - Added note that this Popen use is not threadsafe.
+        * Scripts/webkitpy/common/system/executive.py:
+         - Fixed our Executive.run_* to workaround python bug 2320.
+        * Scripts/webkitpy/common/system/user.py:
+         _ Added note that this Popen use is not threadsafe.
+        * Scripts/webkitpy/layout_tests/layout_package/json_results_generator.py: ditto.
+        * Scripts/webkitpy/layout_tests/port/apache_http_server.py: ditto.
+        * Scripts/webkitpy/layout_tests/port/base.py:
+         - Change wdiff back to using run_command now that we believe it
+           to be threadsafe.
+        * Scripts/webkitpy/layout_tests/port/chromium.py:
+         - Fix to use Executive in places.
+         - Pass self._executive down to the Driver for easier unit testing.
+        * Scripts/webkitpy/layout_tests/port/chromium_win.py:
+         - Re-factor to use a _kill_all method.
+         - Made the _kill_all method use run_command to be threadsafe.
+        * Scripts/webkitpy/layout_tests/port/http_server.py:
+         - Add FIXME about using Executive.
+        * Scripts/webkitpy/layout_tests/port/server_process.py:
+         - Use Executive to be threadsafe.
+        * Scripts/webkitpy/layout_tests/port/webkit.py:
+         - Pass self._executive down to the Driver.
+        * Scripts/webkitpy/layout_tests/port/websocket_server.py:
+         - Add note about Popen not being threadsafe.
+        * Scripts/webkitpy/layout_tests/rebaseline_chromium_webkit_tests.py:
+         - Move one caller to run_command add notes about moving others.
 -04-27  Adam Barth  <abarth@webkit.org>

trunk/WebKitTools/Scripts/webkitpy/common/checkout/scm_unittest.py

r58261	r58314
55	55	# Callers could use run_and_throw_if_fail(args, cwd=cwd, quiet=True)
56	56	def run_silent(args, cwd=None):
	57	# Note: Not thread safe: http://bugs.python.org/issue2320
57	58	process = subprocess.Popen(args, stdout=subprocess.PIPE, stderr=subprocess.PIPE, cwd=cwd)
58	59	process.communicate() # ignore output

trunk/WebKitTools/Scripts/webkitpy/common/system/executive.py

-                      r58036
+                      r58314
 class Executive(object):
+    def _should_close_fds(self):
+        # We need to pass close_fds=True to work around Python bug #2320
+        # (otherwise we can hang when we kill DumpRenderTree when we are running
+        # multiple threads). See http://bugs.python.org/issue2320 .
+        # Note that close_fds isn't supported on Windows, but this bug only
+        # shows up on Mac and Linux.
+        return sys.platform not in ('win32', 'cygwin')
     def _run_command_with_teed_output(self, args, teed_output):
         args = map(unicode, args)  # Popen will throw an exception if args are non-strings (like int())
         child_process = subprocess.Popen(args,
                                          stdout=subprocess.PIPE,
+                                         stderr=subprocess.STDOUT)
+                                         stderr=subprocess.STDOUT,
+                                         close_fds=self._should_close_fds())
         # Use our own custom wait loop because Popen ignores a tee'd
 …
     def _compute_stdin(self, input):
         """Returns (stdin, string_to_communicate)"""
+        # FIXME: We should be returning /dev/null for stdin
+        # or closing stdin after process creation to prevent
+        # child processes from getting input from the user.
         if not input:
             return (None, None)
 …
                     return_stderr=True,
                     decode_output=True):
+        """Popen wrapper for convenience and to work around python bugs."""
         args = map(unicode, args)  # Popen will throw an exception if args are non-strings (like int())
         stdin, string_to_communicate = self._compute_stdin(input)
 …
                                    stdout=subprocess.PIPE,
                                    stderr=stderr,
+                                   cwd=cwd)
+                                   cwd=cwd,
+                                   close_fds=self._should_close_fds())
         output = process.communicate(string_to_communicate)[0]
         # run_command automatically decodes to unicode() unless explicitly told not to.

trunk/WebKitTools/Scripts/webkitpy/common/system/user.py

r56975	r58314
68	68	pager = os.environ.get("PAGER") or "less"
69	69	try:
	70	# Note: Not thread safe: http://bugs.python.org/issue2320
70	71	child_process = subprocess.Popen([pager], stdin=subprocess.PIPE)
71	72	child_process.communicate(input=message)

trunk/WebKitTools/Scripts/webkitpy/layout_tests/layout_package/json_results_generator.py

-                      r58036
+                      r58314
             results_file.close()
+    # FIXME: Callers should use scm.py instead.
     def _get_svn_revision(self, in_directory):
         """Returns the svn revision for the given directory.
 …
         """
         if os.path.exists(os.path.join(in_directory, '.svn')):
+            # Note: Not thread safe: http://bugs.python.org/issue2320
             output = subprocess.Popen(["svn", "info", "--xml"],
                                       cwd=in_directory,

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/apache_http_server.py

-                      r58036
+                      r58314
         # the sake of Window/Cygwin and it needs quoting that breaks
         # shell=False.
+        # FIXME: We should not need to be joining shell arguments into strings.
+        # shell=True is a trail of tears.
+        # Note: Not thread safe: http://bugs.python.org/issue2320
         self._httpd_proc = subprocess.Popen(self._start_cmd,
                                             stderr=subprocess.PIPE,

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/base.py

-                      r58279
+                      r58314
+# Python bug workaround.  See Port.wdiff_text() for an explanation.
+# Python's Popen has a bug that causes any pipes opened to a
+# process that can't be executed to be leaked.  Since this
+# code is specifically designed to tolerate exec failures
+# to gracefully handle cases where wdiff is not installed,
+# the bug results in a massive file descriptor leak. As a
+# workaround, if an exec failure is ever experienced for
+# wdiff, assume it's not available.  This will leak one
+# file descriptor but that's better than leaking each time
+# wdiff would be run.
+#
+# http://mail.python.org/pipermail/python-list/
+#    2008-August/505753.html
+# http://bugs.python.org/issue3210
 _wdiff_available = True
 _pretty_patch_available = True
 …
                actual_filename,
                expected_filename]
+        # FIXME: Why not just check os.exists(executable) once?
+        global _wdiff_available
+        global _wdiff_available  # See explaination at top of file.
         result = ''
         try:
-            # Python's Popen has a bug that causes any pipes opened to a
-            # process that can't be executed to be leaked.  Since this
-            # code is specifically designed to tolerate exec failures
-            # to gracefully handle cases where wdiff is not installed,
-            # the bug results in a massive file descriptor leak. As a
-            # workaround, if an exec failure is ever experienced for
-            # wdiff, assume it's not available.  This will leak one
-            # file descriptor but that's better than leaking each time
-            # wdiff would be run.
+            #
-            # http://mail.python.org/pipermail/python-list/
-            #    2008-August/505753.html
-            # http://bugs.python.org/issue3210
+            #
-            # It also has a threading bug, so we don't output wdiff if
-            # the Popen raises a ValueError.
-            # http://bugs.python.org/issue1236
             if _wdiff_available:
+                try:
+                    # FIXME: Use Executive() here.
+                    wdiff = subprocess.Popen(cmd,
+                        stdout=subprocess.PIPE).communicate()[0]
+                except ValueError, e:
+                    # Working around a race in Python 2.4's implementation
+                    # of Popen().
+                    wdiff = ''
+                wdiff = self._executive.run_command(cmd, decode_output=False)
                 wdiff = cgi.escape(wdiff)
                 wdiff = wdiff.replace('##WDIFF_DEL##', '<span class=del>')

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/chromium.py

-                      r58279
+                      r58314
 import http_server
+from webkitpy.common.system.executive import Executive
 # FIXME: To use the DRT-based version of this file, we need to be able to
 # run the webkit code, which uses server_process, which requires UNIX-style
 …
     """Abstract base class for Chromium implementations of the Port class."""
     def __init__(self, port_name=None, options=None):
         base.Port.__init__(self, port_name, options)
+    def __init__(self, port_name=None, options=None, **kwargs):
+        base.Port.__init__(self, port_name, options, **kwargs)
         self._chromium_base_dir = None
 …
     def check_sys_deps(self, needs_http):
+        dump_render_tree_binary_path = self._path_to_driver()
+        proc = subprocess.Popen([dump_render_tree_binary_path,
+                                '--check-layout-test-sys-deps'])
+        if proc.wait():
+        cmd = [self._path_to_driver(), '--check-layout-test-sys-deps']
+        if self._executive.run_command(cmd, return_exit_code=True):
             _log.error('System dependencies check failed.')
             _log.error('To override, invoke with --nocheck-sys-deps')
 …
             webbrowser.open(uri, new=1)
         else:
+            # Note: Not thread safe: http://bugs.python.org/issue2320
             subprocess.Popen([self._path_to_driver(), uri])
 …
         """Starts a new Driver and returns a handle to it."""
         if self._options.use_drt:
             return webkit.WebKitDriver(self, image_path, options)
         return ChromiumDriver(self, image_path, options)
+            return webkit.WebKitDriver(self, image_path, options, exectuive=self._executive)
+        return ChromiumDriver(self, image_path, options, exectuive=self._executive)
     def start_helper(self):
 …
         if helper_path:
             _log.debug("Starting layout helper %s" % helper_path)
+            # Note: Not thread safe: http://bugs.python.org/issue2320
             self._helper = subprocess.Popen([helper_path],
                 stdin=subprocess.PIPE, stdout=subprocess.PIPE, stderr=None)
 …
     """Abstract interface for test_shell."""
     def __init__(self, port, image_path, options):
+    def __init__(self, port, image_path, options, executive=Executive()):
         self._port = port
         self._configuration = port._options.configuration
 …
         self._options = options
         self._image_path = image_path
+        self._executive = executive
     def start(self):
 …
                     _log.warning('stopping test driver timed out, '
                                  'killing it')
+                    # FIXME: This should use Executive.
+                    null = open(os.devnull, "w")  # Does this need an encoding?
+                    subprocess.Popen(["kill", "-9",
+                                     str(self._proc.pid)], stderr=null)
+                    null.close()
+                    self._executive.kill_process(self._proc.pid)

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/chromium_win.py

-                      r58146
+                      r58314
                                             'wdiff.exe')
+    def _kill_all(self, process_name):
+        cmd = ['taskkill.exe', '/f', '/im', process_name]
+        self._executive.run_command(cmd)
     def _shut_down_http_server(self, server_pid):
         """Shut down the lighttpd web server. Blocks until it's fully
 …
             server_pid: The process ID of the running server.
         """
+        subprocess.Popen(('taskkill.exe', '/f', '/im', 'LightTPD.exe'),
+                        stdin=open(os.devnull, 'r'),
+                        stdout=subprocess.PIPE,
+                        stderr=subprocess.PIPE).wait()
+        subprocess.Popen(('taskkill.exe', '/f', '/im', 'httpd.exe'),
+                        stdin=open(os.devnull, 'r'),
+                        stdout=subprocess.PIPE,
+                        stderr=subprocess.PIPE).wait()
+        # FIXME: Why are we ignoring server_pid and calling
+        # _kill_all instead of Executive.kill_process(pid)?
+        self._kill_all("LightTPD.exe")
+        self._kill_all("httpd.exe")

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/http_server.py

-                      r58146
+                      r58314
             setup_mount = self._port_obj.path_from_chromium_base('third_party',
                 'cygwin', 'setup_mount.bat')
+            # FIXME: Should use Executive.run_command
             subprocess.Popen(setup_mount).wait()
         _log.debug('Starting http server')
+        # FIXME: Should use Executive.run_command
         self._process = subprocess.Popen(start_cmd, env=env)

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/server_process.py

-                      r58036
+                      r58314
 import time
+from webkitpy.common.system.executive import Executive
 _log = logging.getLogger("webkitpy.layout_tests.port.server_process")
 …
     as necessary to keep issuing commands."""
     def __init__(self, port_obj, name, cmd, env=None):
+    def __init__(self, port_obj, name, cmd, env=None, executive=Executive()):
         self._port = port_obj
         self._name = name
 …
         self._env = env
         self._reset()
+        self._executive = executive
     def _reset(self):
 …
             raise ValueError("%s already running" % self._name)
         self._reset()
+        # close_fds is a workaround for http://bugs.python.org/issue2320
         close_fds = sys.platform not in ('win32', 'cygwin')
         self._proc = subprocess.Popen(self._cmd, stdin=subprocess.PIPE,
 …
                 _log.warning('stopping %s timed out, killing it' %
                              self._name)
+                # FIXME: This should use Executive.
+                null = open(os.devnull, "w")
+                subprocess.Popen(["kill", "-9",
+                                  str(self._proc.pid)], stderr=null)
+                null.close()
+                self._executive.kill_process(self._proc.pid)
                 _log.warning('killed')
         self._reset()

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/webkit.py

-                      r58279
+                      r58314
 import webbrowser
+from webkitpy.common.system.executive import Executive
 import webkitpy.common.system.ospath as ospath
 import webkitpy.layout_tests.port.base as base
 …
     """WebKit implementation of the Port class."""
     def __init__(self, port_name=None, options=None):
         base.Port.__init__(self, port_name, options)
+    def __init__(self, port_name=None, options=None, **kwargs):
+        base.Port.__init__(self, port_name, options, **kwargs)
         self._cached_build_root = None
         self._cached_apache_path = None
 …
     def create_driver(self, image_path, options):
         return WebKitDriver(self, image_path, options)
+        return WebKitDriver(self, image_path, options, executive=self._executive)
     def test_base_platform_names(self):
 …
     """WebKit implementation of the DumpRenderTree interface."""
     def __init__(self, port, image_path, driver_options):
+    def __init__(self, port, image_path, driver_options, executive=Executive()):
         self._port = port
         # FIXME: driver_options is never used.

trunk/WebKitTools/Scripts/webkitpy/layout_tests/port/websocket_server.py

r58147	r58314
208	208	_log.debug('cmdline: %s' % ' '.join(start_cmd))
209	209	# FIXME: We should direct this call through Executive for testing.
	210	# Note: Not thread safe: http://bugs.python.org/issue2320
210	211	self._process = subprocess.Popen(start_cmd,
211	212	stdin=open(os.devnull, 'r'),

trunk/WebKitTools/Scripts/webkitpy/layout_tests/rebaseline_chromium_webkit_tests.py

-                      r58036
+                      r58314
 import zipfile
+from webkitpy.common.system.executive import run_command
 import port
 from layout_package import test_expectations
 …
     # Use a shell for subcommands on Windows to get a PATH search.
+    # FIXME: shell=True is a trail of tears, and should be removed.
     use_shell = sys.platform.startswith('win')
+    # Note: Not thread safe: http://bugs.python.org/issue2320
     p = subprocess.Popen(command, stdout=subprocess.PIPE,
                          stderr=subprocess.STDOUT, shell=use_shell)
 …
         return self._rebaselining_tests
+    # FIXME: Callers should use scm.py instead.
     def _get_repo_type(self):
         """Get the repository type that client is using."""
+        output, return_code = run_shell_with_return_code(['svn', 'info'],
+                                                         False)
+        return_code = run_command(['svn', 'info'], return_exit_code=True)
         if return_code == 0:
             return REPO_SVN
 …
             _log.info('No test was rebaselined so nothing to remove.')
+    # FIXME: Callers should move to SCM.add instead.
     def _svn_add(self, filename):
         """Add the file to SVN repository.

Note: See TracChangeset for help on using the changeset viewer.

Context Navigation

Changeset 58314 in webkit

Legend:

Download in other formats: