Project

General

Profile

DDD raw frame file transfer » History » Version 18

Anchi Cheng, 07/11/2014 11:46 AM

1 1 Anchi Cheng
h1. DDD raw frame file transfer
2
3 3 Jim Pulokas
h2. Introduction:
4 2 Jim Pulokas
5 5 Jim Pulokas
Due to the limited disk space on the camera host computer, we wish to transfer the raw frame data to a location with more storage space.  The following assumes the camera host is a Windows system and the destination is a linux system.  For best performance, the linux system should be directly attached to the destination storage rather than over a network.  The "rawtransfer.py" script will run continuously in the background on the linux system, independent from Leginon.  It will periodically check the Leginon database to see if any new raw frames have been acquired by Leginon, then pull them over the network from the camera host.
6 2 Jim Pulokas
7 7 Anchi Cheng
In addition, the script changes the directory naming (DE) or stack file naming (K2 Summit) to reflect the integrated image filename transmitted to Leginon.  Therefore, even if you have network it so that the frames are directly saved to the file server, you should make modification of rawtransfer.py and use it to do this filename synchronization.
8
9 3 Jim Pulokas
h2. Initial setup of your systems:
10 2 Jim Pulokas
11 16 Anchi Cheng
*Camera host (Windows)*:  the folder where your raw frames are stored must be configured as a shared folder (without a password in our example).  Remember the share name, as you will need it when mounting from linux.  For more information about sharing on Windows 7, see http://windows.microsoft.com/en-us/windows/share-files-with-someone#1TC=windows-7
12 2 Jim Pulokas
13 6 Jim Pulokas
*Destination host (Linux)*:  The samba-client package must be installed in order to mount the shared Windows folder
14 1 Anchi Cheng
15 3 Jim Pulokas
h2. Mount the shared folder
16 1 Anchi Cheng
17 16 Anchi Cheng
From the linux host, mount the shared folder on the camera host with SAMBA.  For example, the windows system host name is "decamera" and the shared folder is  "D:\RawFrames\DE12\", but has a share name of "DE12".  The mount command would be:<pre>mount //decamera/DE12 /example_mount_point</pre>(replace example_mount_point with a directory you have created for this purpose).
18 1 Anchi Cheng
19 17 Anchi Cheng
h2. Create the folder to store the frames organizes by Leginon sessions.
20
21
See [[File Server Setup Considerations]] for the location saved in the database when the session was started.
22
23 18 Anchi Cheng
As an example, leginon.cfg specified image path to be /your_file_server_mount_point/whatever/frames/sessionname/rawdata/.  Your frame path for the session is then located at /your_file_server_mount_point/whatever/frames/sessionname/rawdata/.  The folder you need to create now is, therefore, /your_file_server_mount_point/whatever/frames.
24
25 3 Jim Pulokas
h2. Start rawtransfer script in the background
26 1 Anchi Cheng
27 8 Anchi Cheng
The script "rawtransfer.py" is provided in the leginon package, usually installed in your python site-packages folder (referred to as PYTHONSITEPKG below).  In the future, this script will be wrapped up in a proper init script such as you would find in /etc/init.d.  That would allow for automatically starting it when the system boots, but for now you must launch it manually.
28 1 Anchi Cheng
29 8 Anchi Cheng
*myami-2.2*
30 1 Anchi Cheng
The script will check the environment variable RAW_FRAMES_PATH for the location of the shared folder mount point.  The following command line will set that variable and then launch rawtransfer.py in the background:<pre>RAW_FRAMES_PATH=/example_mount_point nohup $PYTHONSITEPKG/leginon/rawtransfer.py > rawtransfer.log &</pre>The "nohup" ensures that it will continue running even when you log out or the terminal session is closed.
31
The example generates the log file "rawtransfer.log" in whatever directory you run this from.  There is no automatic log rotation, so you must occasionally check that this file does not grow out of control.
32
33 17 Anchi Cheng
*myami-3.0 and up*
34 8 Anchi Cheng
You need to pass in several options
35
<pre>
36
  --method=METHOD       method to transfer, e.g. --method=mv or --method=rsync
37
  --source_path=PATH    Mounted parent path to transfer, e.g. --source_path=/mnt/ddframes
38
  --camera_host=CAMERA_HOST   Camera computer hostname in leginondb, e.g. --camera_host=gatank2
39
  --destination_head=PATH  Specific head destination frame path to transfer if
40
                        multiple frame transfer is run for one source to frame paths not all mounted on the same computer, e.g. --destination_head=/data1
41
</pre>
42 12 Anchi Cheng
43 1 Anchi Cheng
For example,
44
<pre>
45 18 Anchi Cheng
nohup $PYTHONSITEPKG/leginon/rawtransfer.py --method=rsync --source_path=/example_mount_point --camera_host=scope1cam1 --destination_head=/your_file_server_mount_point/whatever > rawtransfer.log
46 1 Anchi Cheng
</pre>
47
48 13 Anchi Cheng
* mv method is used when the frame files are saved directly onto network drive.  No transfer is needed.  However, since the filename is only timestamped, this script will change the filenames and organize them by sessions.
49 1 Anchi Cheng
* In this version, files are saved under frames directory and organized by session below.   With the destination_head in the example, the resulting frames.st file will be in
50 13 Anchi Cheng
<pre>
51 18 Anchi Cheng
/your_file_server_mount_point/whatever/frames/your_session/rawdata/image_name.st
52 1 Anchi Cheng
</pre>
53 15 Anchi Cheng
* destination_head/frames directory should exist and writable before running this.
54 8 Anchi Cheng
* If you have another frame-saving camera on scop2cam1 then you start another on with different -source_path and --camera_host values.
55 12 Anchi Cheng
56 8 Anchi Cheng
57
The script is currently set up to check for new frames every 30 seconds.  If it finds new files, it will check the database to see if they are connected to a Leginon image.  If it is from Leginon, then it will rsync or move the folder to the rawdata folder for that session.  The name of the folder will be something like: 11nov28_00001sq_00001_hl_00001_en.frames (based on the same name as the Leginon mrc image).  Then it will remove the original folder.  It looks like it can copy one frame per second, but this could vary depending on network traffic.  Any frames folders found that are not connected to a Leginon image are not transferred and not removed, so you may want to periodically clean up those extra frames.