Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Tcp mode #636

Open
wants to merge 79 commits into
base: main
Choose a base branch
from
Open

Tcp mode #636

wants to merge 79 commits into from

Conversation

FlyingSamson
Copy link

@FlyingSamson FlyingSamson commented Jun 29, 2021

References

I could not find any explicit issue that would be solved by this PR. However, the requirement for TCP support has been mentioned in #282 and #184 (The letter also contains some comments regarding design decisions)

Code changes

  • As discussed in Multiple sources of LSP messages on frontend and backend #184 this PR firstly provides the switch from AsyncIO to AnyIO for the LanguageServerSession class and the LspStreamBase, LspStreamReader and LspStreamWriter classes (formerly LspStdIoBase, LspStdIoReader and LspStdIoWriter)
  • Second it does provide the option to connect to TCP Language Servers (tested with Wolfram-Language-Server, I'm happy to also provide my specification for it in another PR).
  • As there are now different ways to connect to a LS, the specification given in python_packages/jupyter_lsp/jupyter_lsp/schema/schema.json was extended by a mode property which may take on values stdio or tcp, stdio being the default if no mode is specified, so as to not break existing language server specifications. If mode==tcp, a LS is started in a new process on localhost and on a randomly selected free port. The port is controlled by replacing each occurrence of the placeholder {port} within the argv of the server's spec with that port.

Backwards-incompatible changes

There should be no backward-incompatible changes as the default values for the newly introduced mode property is stdio and thus no changes for the (currently solely used) language servers communicating over stdio is required.

TODOS:

  • Adopt the documentation regarding extension of jupyterlab-lsp by new language servers (-> now also possible to use tcp servers, explain new configuration parameters)
  • Split Session into StdioSession and TCPSession

Chores

  • linted
  • tested
  • documented
  • changelog entry

@bollwyvl
Copy link
Collaborator

Thank you so much for this! 😍 Despite all the ❌, this is a really good step forward.

I guess I'd like to see Session split into two implementations so there is just the one if mode == "tcp" check at the manager level. It is probably reasonable to either:

  • have the (mostly) battle-tested one be the base implementation, and have the new TCP one subclass it
  • make a new base class with a few easy-to-grok inner implementations in TCPSession and StdioSession

I wouldn't go too far overboard with abc's etc.

I'm only a casual user of anyio 2 (despite being tangentially to blame for its introduction into the jupyter stack), so i'll have to do a fair amount study to be able to help evaluate the specifics of this fully, but it does look like a good step forward. I'd really love we if we could get rid of explicit executors, and/or replace our gross, cargo-culted non-blocking win32 code with anyio constructs, but that is a whole separate ball of wax.

It looks like we will be able to test this fairly robustly with python-lsp, which supports --tcp, and indeed would need to crawl back up the coverage scale to feel good about it.

Giving it a pass through jlpm lint and python scripts/lint.py would also help clean things up a hair.

Solider on, and let us know if you get hard stuck anywhere!

@FlyingSamson
Copy link
Author

Thank you so much for taking the time to review my changes and your input! I hope I will find the time in the close future to fix everything.

Regarding anyio: I am not overly proud of the way I managed to get this working with anyio, especially the intersection where non async code (but still running within an existing event loop from jupyter lab) needs to call async code seems much to complicated for my taste. Even more so since the anyio change where start_blocking_portal now required a context manager which does not seem to play overly well with the object oriented world. So if you should find any better way to handle things, please let me know.

I will also try and split the Session class in two.

@FlyingSamson
Copy link
Author

While trying to get things running for externally running lsp-servers, I came across following problem (or so I think):

It seems that the current implementation never asks the server to shut down properly, i.e. no sequence shutdown→ Server, ack→Client, exit→ Server, seems to take place. I presume that was because each session was creating its own process running the required LS anyway, and after the work was done that process was just terminated. However, now that we would potentially like to support asking externally running servers for completions, we would have to close the connection more gracefully with accordance to the language server protocol, no?

So my question is: Am I right about this part of the protocol currently not being implemented? If so, I presume one would have to extend the ClientRequest and ClientNotification enums in connection.ts. But I have no clue as to how I would proceed from there as I have zero knowledge of javascript programming. Do you have some pointers how I would go from there?

@krassowski
Copy link
Member

krassowski commented Jul 3, 2021

So my question is: Am I right about this part of the protocol currently not being implemented?

Yes. I am not sure when we want to ask servers to close, but at the minimum we need to do so at JupyterLab shutdown, which is also the safest option (but not trivial to detect). It is tricky because user closing the window does not mean that the session ends - they may be just switching to another browser or refreshing the window. So we would probably need to have a hook in the python code using atexit. Shutting the language server down when all documents for given language are closed is an option, but this requires investigation of how that would work for multi-user deployments.

Having said that, there is also a "shut down" option in JupyterLab's "File" menu, though I never use it (I always exit by terminating the server and closing the window) as I think many users do, hence the remark about the need of having atexit hook.

If so, I presume one would have to extend the ClientRequest and ClientNotification enums in connection.ts

Yes, if we want to emit those from within the frontend (which as described above is complicated and could only reliably happen if and only if the "Shut down" button was pressed in JupyterLab, as any other server disconnect might indicate a temporary network failure, etc...). The next step would be to update IClientNotifyParams and IClientRequestParams, but again I am not convinced whether we want to have this handled by the frontend, or rather by the python extension instead.

We also need to investigate how this would interact with multi-user deployments such as JupyterHub and Real Time Collaboration. In short I do not think that we should implement this in this PR because there are too many unknowns; you are right that we should look into it and thank you for starting this discussion. I think we should create a new issue out of this to allow others to chime in with ideas before deciding on implementation.

Edit: there is a stop_extension hook in works in jupyter_server and we should probably use that over atexit: jupyter-server/jupyter_server#526

@FlyingSamson
Copy link
Author

All right, thanks for clarifying. I will focus on implementing the TCP support for self launched processes for now then and open a separate issue to discuss on how to proceed from this stage, when this PR is done.

@krassowski krassowski added this to the 3.9 milestone Jul 4, 2021
@krassowski
Copy link
Member

Picked up conflicts with schema. Sadly GitHub Actions do not kick in until conflicts are resolved. I also added a note about stop_extension hook in my previous comment.

@FlyingSamson
Copy link
Author

Picked up conflicts with schema. Sadly GitHub Actions do not kick in until conflicts are resolved. I also added a note about stop_extension hook in my previous comment.

All right, so I can just pull master and merge it into the tcp_mode branch, thereby resolving the conflict, and then push again, correct?

@bollwyvl
Copy link
Collaborator

bollwyvl commented Jul 8, 2021

pull master and merge it into the tcp_mode branch,

or do it through the web ui if you're feeling brave...

@FlyingSamson FlyingSamson marked this pull request as ready for review July 6, 2022 16:05
@FlyingSamson
Copy link
Author

I think the code should be ready for another review round (I don't think that the smoke installation error was caused by the made changes).

To fully take advantage of the structured concurrency offered by anyio, I think more work needs to be done from the jupyterlab extension API side, as the current implementation using Tornado Websockets, while supporting coroutines for open() and on_message(), does not seem to be made for structured concurrency. E.g., the doc of open() states that no messages will be processed until open() has returned, thus preventing any structured concurrency happening within without using another event loop in a separate thread.

@krassowski
Copy link
Member

Looking at the issue of freezes everywhere except for Ubuntu py3.10 on; there is no sane way to debug it on CI/with pytest, but locally I can reproduce it. Starting lab I see:

Exception while listening Cannot add child handler, the child watcher does not have a loop attached
    Traceback (most recent call last):
      File "jupyterlab-lsp/python_packages/jupyter_lsp/jupyter_lsp/session.py", line 162, in run
        await self.initialize()
      File "jupyterlab-lsp/python_packages/jupyter_lsp/jupyter_lsp/session.py", line 180, in initialize
        await self.init_process()
      File "jupyterlab-lsp/python_packages/jupyter_lsp/jupyter_lsp/session.py", line 328, in init_process
        await self.start_process(self.spec["argv"])
      File "jupyterlab-lsp/python_packages/jupyter_lsp/jupyter_lsp/session.py", line 230, in start_process
        env=self.substitute_env(self.spec.get("env", {}), os.environ),
      File ".pyenv/versions/3.7.15/envs/lsp-lab3.x-py3.7/lib/python3.7/site-packages/anyio/_core/_subprocesses.py", line 135, in open_process
        start_new_session=start_new_session,
      File ".pyenv/versions/3.7.15/envs/lsp-lab3.x-py3.7/lib/python3.7/site-packages/anyio/_backends/_asyncio.py", line 1112, in open_process
        start_new_session=start_new_session,
      File ".pyenv/versions/3.7.15/lib/python3.7/asyncio/subprocess.py", line 217, in create_subprocess_exec
        stderr=stderr, **kwds)
      File ".pyenv/versions/3.7.15/lib/python3.7/asyncio/base_events.py", line 1544, in subprocess_exec
        bufsize, **kwargs)
      File ".pyenv/versions/3.7.15/lib/python3.7/asyncio/unix_events.py", line 193, in _make_subprocess_transport
        self._child_watcher_callback, transp)
      File ".pyenv/versions/3.7.15/lib/python3.7/asyncio/unix_events.py", line 941, in add_child_handler
        "Cannot add child handler, "
    RuntimeError: Cannot add child handler, the child watcher does not have a loop attached

Coming from self.process = await anyio.open_process() call.

@krassowski
Copy link
Member

krassowski commented Jan 1, 2023

This appears to be https://bugs.python.org/issue35621 which was only fixed in Python 3.8 as the workaround from https://bugs.python.org/msg370355:

import asyncio
from .threaded_child_watcher import ThreadedChildWatcher   # where ThreadedChildWatcher was copied from CPython
asyncio.set_child_watcher(ThreadedChildWatcher())

does work. This is for Unix only, no idea how to fix this on Windows, and the workaround may have side-effects on tornado.

Switching to trio backend gives us:

RuntimeError: There is no current event loop in thread 'Thread-2'.

@krassowski
Copy link
Member

I guess the question becomes: do we really need/want start_process to by async?

@krassowski
Copy link
Member

So things are getting interesting with krassowski@12bfbd4:

  • Windows py3.10 works on CI (which gives me hope) but throws ValueError: I/O operation on closed pipe from time to time in tests which is a known problem and apparently can be worked around by calling private process._transport.close(). Probably worth finding a bpo or filing an issue with cPython.
  • Ubuntu py3.7 starts up locally but hangs when running tests (both locally and on CI). When manually aborting the tests we can see that it raises:
Traceback (most recent call last):
  File "/3.7.15/lib/python3.7/threading.py", line 926, in _bootstrap_inner
    self.run()
  File "/3.7.15/lib/python3.7/threading.py", line 870, in run
    self._target(*self._args, **self._kwargs)
  File "/lsp-lab3.x-py3.7/lib/python3.7/site-packages/anyio/_core/_eventloop.py", line 70, in run
    return asynclib.run(func, *args, **backend_options)
  File "/lsp-lab3.x-py3.7/lib/python3.7/site-packages/anyio/_backends/_asyncio.py", line 292, in run
    return native_run(wrapper(), debug=debug)
  File "/3.7.15/lib/python3.7/asyncio/runners.py", line 43, in run
    return loop.run_until_complete(main)
  File "/3.7.15/lib/python3.7/asyncio/base_events.py", line 587, in run_until_complete
    return future.result()
  File "/lsp-lab3.x-py3.7/lib/python3.7/site-packages/anyio/_backends/_asyncio.py", line 287, in wrapper
    return await func(*args)
  File "jupyterlab-lsp/python_packages/jupyter_lsp/jupyter_lsp/session.py", line 179, in run
    await self.cleanup()
  File "jupyterlab-lsp/python_packages/jupyter_lsp/jupyter_lsp/session.py", line 205, in cleanup
    await self.stop_process(self.stop_timeout)
  File "jupyterlab-lsp/python_packages/jupyter_lsp/jupyter_lsp/session.py", line 254, in stop_process
    self.process.terminate()
  File "/lsp-lab3.x-py3.7/lib/python3.7/site-packages/anyio/_backends/_asyncio.py", line 1053, in terminate
    self._process.terminate()
  File "/3.7.15/lib/python3.7/asyncio/subprocess.py", line 131, in terminate
    self._transport.terminate()
  File "/3.7.15/lib/python3.7/asyncio/base_subprocess.py", line 150, in terminate
    self._check_proc()
  File "/3.7.15/lib/python3.7/asyncio/base_subprocess.py", line 143, in _check_proc
    raise ProcessLookupError()
ProcessLookupError

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants