Project

General

Profile

KLH Dataset I » History » Version 39

Neil Voss, 07/21/2010 12:31 PM

1 39 Neil Voss
h1. Annotated Dataset of Images of Keyhole Limpet Hemocyanin Particles I
2 2 Eric Hou
3
h2. 1. Imaging Conditions
4
5
Using a Philips CM200 TEM equipped with a 2Kx2K CCD Tietz camera, images are acquired in defocus pairs at a nominal magnification of 66,000 x and a voltage of 120 KeV, using the Leginon system (Potter et al., 1999; Carragher et al., 2000). The first image (named *.001.mrc) is acquired at very near to focus (NTF) conditions (e.g., -1µm) and the second one (named *.002.mrc) at farther from focus (FFF) conditions (e.g., -3µm). The time interval between the two exposures is approximately 20s due to the time required to read out the digital image from the camera. At this magnification, the pixel size is 2.2Å at the specimen scale and the accumulated dose for high magnification image area was about 10 e/Ų. Figure 1 shows an example pair of defocus images. Click on them to see the pictures in full size. 
6
7 20 Eric Hou
p=. (a) !fig1a-small.jpg!:http://emg.nysbc.org/prtl_data/klh/klh_1k/fig1a.jpg           (b) !fig1b-small.jpg!:http://emg.nysbc.org/prtl_data/klh/klh_1k/fig1b.jpg
8 7 Eric Hou
(a) Near to focus - NTF image. (b) Far from focus - FFF image.
9 1 Eric Hou
10 7 Eric Hou
p=. *Figure 1: An example pair of high magnification images of KLH.*
11 2 Eric Hou
12
There are at least two major advantages of using a defocus pair of images. First, by combining the two images in the defocus pair, relatively high contrast at both low and high spatial frequencies can be attained. Second, the moderately strong low-resolution signals in the FFF images make it possible for us to develop algorithms to identify particles automatically. The idea of using a defocus pair of images has been explored by several other researchers. 
13
14
h2. 2. Downloading High Magnification Images
15
16 38 Eric Hou
Image files are in "MRC":http://emg.nysbc.org/prtl_data/mrc_specification.htm format. We also provide a JPEG file for each image for your convenience in viewing them. You may use one of the following options to download the set of high magnification images. 
17 2 Eric Hou
18 6 Eric Hou
* Download only far-from-focus (FFF) images: "MRC files":http://emg.nysbc.org/prtl_data/klh/klh_1k/exposure2.mrc.tar.gz (384MB), "JPEG files":http://emg.nysbc.org/prtl_data/klh/klh_1k/exposure2.jpg.tar (94MB). 
19
* Download only near-to-focus (NTF) images: "MRC files":http://emg.nysbc.org/prtl_data/klh/klh_1k/exposure1.mrc.tar.gz (383MB), "JPEG files":http://emg.nysbc.org/prtl_data/klh/klh_1k/exposure1.jpg.tar (94MB). 
20 2 Eric Hou
21
Nevertheless, there is a freely available tool, called "em2em":http://www.imagescience.de/em2em, which might be able to convert MRC files to your favorite formats. 
22
23
h2. 3. Positions of the Picked Particles in the Images
24
25
Since the NTF image in a defocus pair covers almost the same specimen area as the FFF image, the relative distance between particles within the NTF image should be the same as that in the FFF image. Using phase correlation, we are able to accurately align the NTF image to the FFF image in a defocus pair (Zhu et al., 2001). Therefore particles in the NTF image can be then extracted using the positions of the particles identified in the FFF image shifted according to the results of the alignment. Although only the particles selected in NTF images will be passed to the later reconstruction stage, we provide in the following only the positions of particles either manually or automatically picked in the FFF images.
26
27 11 Eric Hou
p=. (a) !fig02a-small.jpg!:http://emg.nysbc.org/prtl_data/klh/klh_1k/fig02a.jpg           (b) !fig02b-small.jpg!:http://emg.nysbc.org/prtl_data/klh/klh_1k/fig02b.jpg
28 9 Eric Hou
(a) The NTF image. (b) The FFF image.
29
30 12 Eric Hou
p=. *Figure 2: An example pair of images outlined with particles automatically picked by Selexon. Each "+" indicate a detected particle.*
31 2 Eric Hou
32
As we mentioned in the introduction, particle picking is an open, unresolved problem. Even for biological experts, the final picks may vary from person to person. We therefore posted here more than one set of man or machine's picks. Besides posting the particles picked by our own program Selexon, we will also post other automated picks, such as those generated by Spider, EMAN, etc., upon available. For each set of picked particles, we give a brief description of the criteria for manual picks, or the algorithm for automated picks. Links to more detailed descriptions will be provided when available.
33
34 21 Eric Hou
p=. *Table 1: Positions of Manual Picked Particles.*
35 2 Eric Hou
36 17 Eric Hou
|*Picker*|*Particle Picking Criteria in Picker's words*|*Links to download*|
37
|Fabrice Mouche|The KLH didecamer presents two main orientations, a rectangular sideview and a circular topview. A cryoelectron microscopy field also shows the presence of intermediate views, of broken molecules and of aggregate of two or more particles. From the 82 images obtained with the CCD camera, 1042 single particles were manually and interactively extracted, using SPIDER and WEB (Frank et al., 1996). Only rectangular sideviews and intermediate orientations were selected with a percentage of 95 % and 5 %, respectively. No aggregate or "single" particle showing a different length, shorter or longer, was manually picked. Furthermore, to avoid any reconstruction artifact due to an overabundant type of views (Boisset et al., 1998), no circular topview was selected. The presence of a D5 point group symmetry and its application during the reconstruction procedure were sufficient to avoid any lack of structure information, especially along the Z-axis.|"Coordinates":http://emg.nysbc.org/prtl_data/klh/klh_1k/mouche_pik.tar of particles in the FFF images.|
38
39 22 Eric Hou
p=. *Table 2: Positions of Automatically Picked Particles.*
40
41
|*Picker*|*Algorithm Description*|*Links to download*|
42
|Selexon|An edge-based computational approach for automatic particle detection. Under this framework, first the Canny edge detector (Canny, 1986) is applied to cryo-EM images. Then, a sequence of ordered hough transforms (HTs) is either developed or adopted to detect particle contours in edge images. The sequence of HTs is applied in order from the computationally simplest one to the most complex one. Edge elements that are covered by the detected shapes are removed immediately from edge images following the application of the last HT. Hence, the next HT is applied to the edge image becoming equivalent to one that had not contained the detected shapes. By doing so, not only can we taper the effect of noisy edge elements on subsequent HTs, but also reduce significantly the total computational complexity. In the case of picking hemocyanon particles, a sequence of two types of HTs is necessary. First, circular particles are detected using the fast implementation of the Hough transform for the detection of circles , and edges covered by those detected circular regions are removed immediately; then rectangular Hough transform is developed to extract those approximately rectangle-shaped particles (Zhu et al., 2002).|"Coordinates":http://emg.nysbc.org/prtl_data/klh/klh_1k/selexon.pik.tar of particles in the FFF images.|
43 2 Eric Hou
44
We have built a Tcl script to compare one person/machine's pick against another's, taking the first one's pick as the truth information. Using this tool, we build a confusion matrix among the current man/machine picks, listed at the bottom of this section. 
45
46 25 Eric Hou
*Table 3: Confusion matrix obtained when comparing one pick against the others.*
47 23 Eric Hou
48
|*Truth\Test*|*Fabrice Mouche*|*Selexon*|
49
|*Fabrice Mouche*||FNR: 9.7% and FPR: 13.7%|
50 24 Eric Hou
|*Selexon*|FNR: 13.7% and FPR: 9,7%||
51 23 Eric Hou
52
_Note: FNR represents false negative rate; FPR represents false positive rate._
53 2 Eric Hou
54
h2. 4. Sample 3D Reconstructions and a Preliminary 3D Map
55
56 26 Eric Hou
p=. *Table 4: Sample 3D reconstructions generated using particles selected either manually or automatically.*
57
58 33 Eric Hou
|*Picker*|*Three-dimensional Density Map*|*Description of Reconstruction Procedures*|*Comments*|
59 32 Eric Hou
|Fabrice Mouche|!fabrice2.jpg! !fabrice1.jpg!|A D5 point-group symmetry was imposed. The series of 1042 particles was subjected to three cycles of 3D projection alignment, using a previous volume as a reference, and a new reconstruction volume was calculated.|"Coordinates":http://emg.nysbc.org/prtl_data/klh/klh_1k/mouche_pik_into_map.tar of particles went into the map.|
60 34 Eric Hou
|Selexon|!yuanxin2.jpg! !yuanxin1.jpg!|A D5 point-group symmetry was imposed. The series of 1243 particles was subjected to three cycles of 3D projection alignment, using a previous volume as a reference, and a new reconstruction volume was calculated.|None.|
61 2 Eric Hou
62 35 Eric Hou
*Note: A preliminary 3-D map of the particle is also available in two different formats: "MRC format":http://emg.nysbc.org/prtl_data/klh/klh_1k/klh_map.mrc.gz and "SPIDER format":http://emg.nysbc.org/prtl_data/klh/klh_1k/klh_map.spi.gz.* (The size of the map files are about 46 MB after being gzip'ed).
63 2 Eric Hou
64
h2. 5. References
65
66
 # Boisset, N., et al. (1998) Overabundant single-particle electron microscope views induce a three-dimensional reconstruction artifact. Ultramicroscopy *74*: 201-207.
67
 # Canny, J. (1986) A computation approach for edge detection. IEEE Trnas. Patt. Analy. and Machine Intell. *8*: 679-698.
68
 # Carragher, B., Kisseberth, N., Kriegman, D., Milligan, R. A., Potter, C. S., Pulokas, J., and Reilein, A. (2000) Leginon: An automated system for acquisition of images from vitreous ice specimens. J. Struct. Biol. *132*: 33-45.
69
 # Frank, J., et al. (1996) SPIDER and WEB: processing and visualization of images in 3D electron microscopy and related fields. J. Struct. Biol. *116(1)*: 190-9.
70
 # Potter, C. S., Chu, H., Frey, B., Green, C., Kisseberth, N., Mad-den, T. J., Miller, K. L., Nahrstedt, K., Pulokas, J., Reilein, A., Tcheng, D., Weber, D., and Carragher, B. (1999) Leginon: A system for fully automated acquisition of 1000 micrographs a day. Ultramicroscopy 77: 153-161.
71
 # Zhu, Y., B. Carragher, D. Kriegman, R. Milligan, and C. Potter (2001) Automated Identification of Filaments in Cryo-electron Microscopy Images. J. Struct. Biol. *135*: 302-312.
72
 # Zhu, Y., Carragher, B., and Potter, C. S. (2003) Automatic Particle Detection Through Efficient Hough Transforms. IEEE Transactions on Medical Imaging 22(9): 1053-1062.  
73