BASIC BIOINFORMATICS

1.Who coined the term Bioinformatics and when?

a. Robert Ledley, 1978

b. David W Mount, 1977

c. Paulien Hogeweg, 1979

d. Dr Margaret Oakley Dayhoff, 1976

Correct Answer: c

2.GenBank, the Nucleic acid sequence database is maintained by

a. DNA database of Japan

b. European Molecular Biology Laboratory

c. National Centre for Biotechnology Information

d. Brookhaven laboratory

Correct Answer: c

3.Each record in a database is called

a. Query

b. Entry

c. File

d. Record

Correct Answer: b

4.All of the following are protein sequence databases except

a. SWISSPROT

b. EMBL

c. PIR

d. PDB

Correct Answer: b

5.Which of the following is not a primary nucleic acid database?

a. GenBank

b. DDBJ

c. EMBL

d. TrEMBL

Correct Answer: d

6.Margaret Dayhoff developed the first protein sequence database called

a. UniPROT

b. Atlas of sequence and structure protein

c. SWISS PROT

d. Protein sequence databank

Correct Answer: b

7.The structural classification of proteins is based primarily on their

a. Amino acid sequence

b. Evolutionary relationships

c. function

d. Secondary structure content and arrangement

Correct Answer: d

8.In the α helix the hydrogen bonds

a. occur only near the amino and carboxyl termini of the helix

b. are roughly parallel to the axis of the helix

c. are roughly perpendicular to the axis of the helix

d. occur only between some of the amino acids of the helix

Correct Answer: b

9.Which of the following is a RefSeq accession number corresponding to an mRNA?

a. NP_52280

b. AAB134506

c. NM_15392

d. J01536

Correct Answer: c

10.You have a favorite gene, and you want to determine in what tissues it is expressed. Which one of the following resources is likely the most direct route to this information?

a. Entrez

b. Pubmed

c. PCR

d. UniGene

Correct Answer: d

11.Which of the following databases is derived from mRNA information?

a. HTGS

b. dbEST

c. PBD

d. OMIM

Correct Answer: b

12.Which of the following databases can be used to access text information about human diseases?

a. PBD

b. OMIM

c. OMIM

d. EST

Correct Answer: c

13.What is the difference between RefSeq and GenBank?

a. RefSeq includes publicly available DNA sequences

submitted from individual laboratories and sequencing

projects

b. GenBank provides nonredundant curated data

c. GenBank sequences are derived from RefSeq.

d. RefSeq sequences are derived from GenBank and provide

nonredundant curated data.

Correct Answer: d

14.Orthologs are defined as

a. Homologous sequences in the same species that arose

through gene duplication

b. Homologous sequences in the same species which have

similar and often redundant functions

c. Homologous sequences in different species that share an ancestral gene

d. Homologous sequences that share little amino acid identity but share great structural similarity

Correct Answer: c

15.A single piece of information in a database is called

a. Dataset

b. Field

c. File

d. Record

Correct Answer: b

16.Literature databases include

a. Pubmed and PDB

b. MEDLINE and PDS

c. MEDLINE and PUBMED

d. MEDLINE and PDB

Correct Answer: c

17.Which of the following is false about the ‘loop’ structure in proteins?

a. They connect helices and sheets

b. They are more flexible and can adopt multiple conformations

c. They are more tolerant of mutations

d. They are never the components of active sites

Correct Answer: d

18.Approximately how large is the human genome?

a. 3000Mb

b. 3Mb

c. 300Mb

d. 30,000Mb

Correct Answer: a

19.You have a reasonably short, typical, double-stranded DNA sequence.Basically, how many proteins can it potentially encode?

a. 6

b. 1

c. 2

d. 3

Correct Answer: a

20.Approximately what percentage of the human genome consists of repetitive elements of various kinds?

a. 0.5

b. 0.05

c. 0.85

d. 0.25

Correct Answer: a

21.What percentage of the human genome is devoted to the protein-coding regions?

a. 1-5%

b. 5-10%

c. 10-20%

d. 20-30%

Correct Answer: a

22.Bioinformatics is also being regarded as part of the

a. Molecular biology

b. Computational biology

c. Computational biotechnology

d. Computer Knowledge

Correct Answer: b

23.The best estimate for the number of human genes is

a. 100000

b. 1000000

c. 50000

d. 30000

Correct Answer: a

24.The Protein Data Bank (PDB)

a. Contains approximately as many structures as there are

protein sequences in SwissProt/TrEMBL

b. Is operated jointly by the NCBI and EBI

c. Includes data on proteins, DNA–protein complexes as well

as carbohydrates

d. Functions primarily as the major worldwide repository of

macromolecular secondary structures

Correct Answer: c

25.Analyzing or comparing entire genome of species is called

a. Pharmacogenomics

b. Genomics

c. Bioinformatics

d. Proteomics

Correct Answer: b

26.Which of the following is untrue about SCOP?

a. It is a database for comparing and classifying

protein structures

b. It is constructed almost entirely based on manual examination

of protein structures

c. The proteins are grouped into hierarchies of classes,

folds, superfamilies, and families

d. The SCOP families consist of proteins having low sequence identity

Correct Answer: d

27.SCOP is _______ based on manual comparison of structures by human experts with no quantitative criteria to group proteins

a. Almost entirely

b. Not

c. Partially

d. never

Correct Answer: a

28.In CATH, Structural domain separation is carried by

a. a combined effort of a human expert and compute r programs

b. Manual comparison only

c. computer programs only

d. human expertise only

Correct Answer: a

29.The term “whole-genome shotgun sequencing” refers to

a. A strategy to sequence an entire genome by breaking up

DNA, cloning it into libraries, sequencing small fragments,

then reassembling the fragments into a complete map

b. A strategy to sequence an entire genome by breaking up

DNA, cloning it into libraries, hybridizing small fragments,

then reassembling the fragments into a complete map

c. A strategy to sequence an entire genome by breaking up DNA and sequencing using oligonucleotide primers that span the genomic DNA

d. ktups, substitution matrix

Correct Answer: a

30.The initiation of FASTA format has ____ symbol

a. >

b. <

c. /

d. *

Correct Answer: a

31.______ the smallest amino acid, has a hydrogen atom as the R group

a. Threonine

b. Glycine

c. Valine

d. Proline

Correct Answer: b

32.Step wise method for solving problems in computer science is called

a. Algorithm

b. Flowchart

c. Sequential design

d. Procedure

Correct Answer: a

33.PRINTS are software used for

a. Detection of gene from genome sequence

b. dentification of functional domain/motifs of protein

c. Prediction of function of a new gene

d. Detecton of tRNAs gene

Correct Answer: b

34.Human genome contains about

a. 3 billion base pairs

b. 2 billion base pairs

c. 5 billion base pairs

d. 4 billion base pairs

Correct Answer: a

35.Which of the following is a protein structure database

a. SwissProt

b. Genbank

c. PDB

d. DDBJ

Correct Answer: c

36.The identification of drugs through genomic study is called

a. Genomics

b. Biochemistry

c. Pharmacogenomics

d. Bioinformatics

Correct Answer: c

37.SWISS PROT is related to

a. Sequence data

b. portable data

c. swiss bank

d. Sequence data bank

Correct Answer: d

38.One centiMorgan is defined as ____ percentage of the total recombination events

a. 0.01

b. 0.1

c. One

d. Ten

Correct Answer: c

39.In general, if you compare the size of a pattern (also called a motif or fingerprint) and a domain

a. They are about the same size

b. The pattern is larger

c. The comparison always depends on the particular proteins

in question

d. The pattern is smaller

Correct Answer: d

40.In comparing two homologous but distantly related proteins

a. They tend to share more percent amino acid identity in

common than three-dimensional structure features

b. They tend to share three-dimensional structure features and percent amino acid identity to a comparable extent

c. It is not reasonable to generalize about the extent to which

they share three-dimensional structure features and percent

amino acid identity

d. They tend to share more three-dimensional structure features

in common than percent amino acid identity

Correct Answer: d

41.The CATH database offers a hierarchical classification of protein structures. The first three levels, class (C), architecture (A), and topology (T), all describe

a. Protein secondary structure (e.g., secondary structure composition, packing, shape, orientation, and connectivity)

b. Protein domain structure

c. Protein superfamilies grouped according to homologous

Domains

d. Protein tertiary structure (e.g., tertiary structure composition,

packing, shape, orientation, and connectivity)

Correct Answer: d

42.What is the source of protein structures in SCOP and CATH?

a. UniPROT

b. Ensembl

c. InterPro

d. Protein Data Bank

Correct Answer: d

43.In an α helix, the R groups on the amino acid residues

a. stack within the interior of the helix

b. are found on the outside of the helix spiral

c. alternate between the outside and the inside of the helix

d. cause only right-handed helices to form

Correct Answer: b

44.The process of introduction of foreign DNA into an animal cell is called

a. Transversion

b. Conversion

c. Inversion

d. Transfection

Correct Answer: d

45.A _______ is defined in SCOP as a collection of superfamilies

a. Primary structure

b. Secondary structure

c. mutated protein

d. Protein Fold

Correct Answer: d

46.Which of the following is a sequence alignment tool?

a. PIR

b. BLAST

c. PRINTS

d. PROSITE

Correct Answer: b

47.The stages of mRNA processing include all of the following except

a. Splicing

b. Export

c. Methylation

d. Polyadenylation

Correct Answer: c

48.Genbank is maintained by

a. SIB

b. NIG

c. NCBI

d. EBI

Correct Answer: c

49.which alignment is useful to detect for highly conserved regions?

a. Multiple

b. Pairwise

c. Local

d. Global

Correct Answer: c

50.when you are comparing two or more than two sequences of same or different organisms,What is the type of alignment?

a. Global

b. Pairwise

c. Multiple

d. Local

Correct Answer: c