Remote Computing

MIT Page (near the end)

Often it is desirable to interact with a computer that isn’t sitting on your desk. Sometimes this means a server of some kind, sommetimes a powerful, shared computer used for high performance computations, or even an ordinary workstation to help troubleshoot.

Historically, this was done using a program called Telnet. While this protocol is still in use today, largely as a novelty, a similar, more secure protocol is now in use called Secure Shell or SSH.

As the name suggests, SSH is often used to establish a shell session on a remote machine. All of the things we have learned previously about using a shell are still valid, the execution just takes place on a different computer.

SSH is simple to invoke:

ssh george@lesica.com

The server to which we would like to connect, along with the username are specified on the command line. In the example above, the username is “george” and the server is “lesica.com”. By default, SSH uses port 22.

It is also possible to specify a command to be run instead of opening a full shell session. This is handy when you only wish to use the remote computer’s computational resources. Output from the command is delivered back to your local computer.

ssh george@lesica.com ls # Listing of george's home directory

Key-based Authentication

It is possible to avoid typing one’s password over and over again. We will walk through setting up key-based authentication in class and take a look at the example on the MIT course web page.

Configuration Options

In addition to key-based authentication, we can also set up “nicknames” for the servers we connect to most often. This is done using the file ~/.ssh/config (create it if it doesn’t exist already). This file can be used to set per-server options. An example is shown below.

Host missing
    HostName missing.cs.umt.edu
    User gl219598e
    IdentityFile ~/.ssh/id_rsa_missing

This is the entry in my config file for the server we will use in class. Note that I’ve nicknamed it “missing”, set the username, and pointed it at a key file. This allows me to connect using a simple command: ssh missing.

Copying Files (scp)

It is possible to copy files to any remote SSH host using the scp program. Since SCP (secure copy) uses the same configuration file, it can be made very easy to use.

scp file.txt missing:/home/gl219598e/

There is also a -r flag that copies directories recursively, making it easy to copy a number of files. If you have a lot of data, however, it might be better to use tar to compress the files beforehand.

# Use scp directly
scp -r myfiles missing:/home/gl219598e/

# Using tar
tar -cxf files.tar.gz myfiles/
scp files.tar.gz missing:/home/gl219598e/

Tunneling

SSH tunneling tutorial

It is possible to route your computer’s network traffic through another machine using SSH. This can be helpful if you need to access services running on the remote machine but not exposed to the Internet. It can also allow you to use the remote machine as a kind of “poor man’s” VPN (I used this technique to watch Hulu when I lived in Chile).

The MIT page has an example and some nice diagrams.

Tmux

A brief Tmux tutorial

A terminal multiplexer allows multiple shell sessions to be run and, more importantly for us, saved and restored. This capability is useful for running a persistent application on a remote server such that you can return to it later as though you’d only gotten up from the computer rather than logged out of it.

Tmux and Screen are the most popular terminal multiplexers. Install Tmux using Homebrew or your Linux distro’s package manager.

Leave a Program Running

What if you want to leave a program running on a server? This could be a long-lived computation, or a program that serves requests. We can do this with nohup, which causes a program to effectively ignore the “hangup” signal it receives when we close the SSH session.

nohup ./longjob.py George

We can also put this into the background either by appending & to the command or using our job control tools. Once we disconnect, the job remains. Note that stdout and stderr are automatically redirected to a file. This makes sense since we expect to “abandon” the program as it runs, but, presumably, we still want its output.

Once we log back in, however, we can’t bring the application back to the foreground, it doesn’t even show up in our jobs listing.

Using Tmux

We can accomplish something similar, but more conveniently, using Tmux. First, we start a Tmux session:

tmux new -s longjob

We can simply run tmux to get a new, anonymous, session too. Now we can run our program normally:

./longjob.py George

We could redirect the output or put the job in the background as well. But once we’re ready to log out we can “detach” from our terminal session with Ctrl-b d or by running tmux detach. Note that the program output stops printing, we’re no longer inside the shell session that has longjob.py running, so we don’t get its output.

Now we can disconnect from the server and come back later. If we run tmux ls we’ll see the longjob session listed. We can “attach” to it:

tmux attach -t longjob

We could also use tmux attach or tmux a since there is only one session. Now the old session is restored, and we can even see all the output that has occurred since we left!

Home | GitHub