Jeongkyu
Lee Home Research Publication Teaching CV Personal |
Project artifacts - Database Administrator From 1996 until 2002, I served as a database administrator at several companies including two banks in Korea as a junior level for five years, and IBM e-Business Hosting team (IDC) as a senior level for two years. During that time, I was responsible for all aspects of database administration: design, implementation, and maintenance of the database infrastructure for the banking systems and hosted servers. After seven-years of industry experience, I decided to join the University of Texas at Arlington to pursue a doctorate degree in computer science, particularly in the areas of databases and multimedia. The reason I moved from industry to academia was that I needed to balance of practical skills in DBMS and strong academic background in databases. After four years of research experience in the area of databases, I am about to return to the proper position where both my industrial and academic backgrounds are needed In this statement, to demonstrate my qualifications and abilities I introduce my belief of a database administrator, and artifacts of database projects I completed successfully as a database administrator. My belief of a Database Administrator
Throughout my life in the industry, I established my belief of a database administrator that I keep in mind wherever I work. My belief is built around five items: 1) user oriented management, 2) harmony of practicality and theory, 3) broad knowledge of IT and computer science, 4) self-motivation and implementation, and 5) documentation.
Artifact
A:
Roles of Database Administrator
Artifact B: Management of Documents Artifact C: TPI (Total Performance Index) for DBMS tuning Artifact D: SDM (Server and Database Monitoring) ToolArtifact
A
Roles of Database Administrator In 2001, I joined IBM e-Business Hosting team as a senior database administrator. IBM e-Business Hosting offers a unique hosting environment for IT infrastructure, called Universal Server Farm (USF), which includes facility, network, server, application, and DBMS. It also covers detailed monitoring, capacity planning and data modeling services. However, the scope of database administrator’s job was not defined clearly, which caused some confusion of responsibilities in the team. As soon as I joined the team, the first job that I completed was to define the roles and scopes of a database administrator. Throughout working at IBM, I performed database administrative jobs based on this document. The roles of a database administrator are defined based on the following criteria:
Main Categories of DBA Roles
First, I determined seven main categories of DBA
roles based on the above criteria as followings:
List of DBA Job for each CategoryNext, I listed up all the detailed jobs in each main category. The information of detailed job contain title, level, period, time and remarks. Each job is classified into three levels, i.e., Basic, Intermediate and Advanced, based on the importance and workload. The period means how often the job is performed, such as daily (D), weekly (W), quarterly (Q), and Yearly/Temporary (Y). The time indicates workloads of the job. The level and time can be used for the evaluation of DBA. Table 1 and 2 are the sample lists of DBA jobs for a monitoring service, and a backup and recovery service, respectively. Table 1. DBA Jobs list for Monitoring Service
Table 2. DBA Jobs list for Backup and Recovery Service
The
full description of roles of database administrator is available on the
following web site:
Artifact B
Management of Documents
From 1996 until 1999, I worked as a junior database
administrator in Information and Technology department at a bank in Korea.
On the first day of attending my office, my manger gave me a simple
question: “Do you think what is the most important job as an
administrator?” Then, his
answer was very simple, too. It
was “Documentation”. Throughout
my seven years experience of a database administrator, I believe that 51%
of DBA jobs are for documenting, while 49% of them are for the other.
The final output of DBA activities is neither recovered systems nor
enhanced SQL statements, but a document where an administrator logged
entire history and processes about how to recover a damaged system and how
to tune up SQL statements. After I became a database administrator, I have created
various documents about database managements, and maintained them
periodically. These documents
always help me to manage and diagnose systems and databases properly.
In addition, these histories provide a clue and cause of system
problems when they occur. List of the
Managed Documents
I introduce 4 selected documents that I created and
managed as a database administrator.
The entire documents are available on my personal DBA web site:
http://www.bridgeport.edu/~jelee/dba/dba.html. 1.
System Configuration for IT Infrastructure
Figure 1. Sample slides of System Configuration for IT Infrastructure 2.
Current State of IT assets
Table 3.
Sample of Current State of IT assets
3.
DBMS Configuration
Table 4. Sample of DBMS Configuration
4.
Analysis of Table Access
Table 5. Sample of Analysis of Table Access
5.
Other Documents
(The above documents are available upon request. Please, email to jelee@bridgeport.edu )
Artifact
C TPI (Total Performance
Index) for DBMS tuning
Database
users complain a performance of system like “Database is very slow. My
program is not running.” They
do not say that “My program cannot access database memory. Please,
increase the size of memory.” Thus,
DBA should be ready to answer any
requests by tuning it up. However,
database tuning is not easy task for an administrator since it is
necessary to check all the aspects of the system including CPU, memory,
disk I/O, and operating system. In
order to keep DBMS best state, I developed a diagnostic tool for DBMS,
called Total Performance Index (TPI) using Microsoft Excel and shell
scripts. TPI checks four categories to evaluate the DBMS, such as memory, disk, educational level of DBA, and system. The detailed items of each category are evaluated by several shell scripts. A perfect score of each category is 250, and the final score of TPI is automatically computed on a maximum scale of 1,000 points. Based on the final score, the DBMS is graded in A to F levels; A is the top 10 percent of most DBMS, and F indicates the bottom 30 percent of most DBMS. In addition, an administrator can use the diagnostic tool to find the weak points to be solved.
Memory Performance Index (MPI) MPI indicates the status of memory utilization in a
database system. The overall
performance of DBMS totally depends on its memory utilization.
In general, it is expected that well-managed DBMS allow a query to
access over 98 percent of data from memory.
If the percentage is below 90, it should be tuned up for better
performance. The MPI score is
computed by considering 9 items. A
shell script automatically investigates the information of each item.
Table 6 shows a MPI score of a sample DBMS.
It indicates that the overall memory performance is very poor that
is caused by some harmful SQL. Table 6. Memory Performance Index (MPI) for a sample DBMS
Disk Performance Index (DPI)
DPI shows a disk performance accessed by a database
system. To keep the best
state of disk performance, there are several rules to be considered when
an administrator configures it; separation of table and index spaces,
partition of table, and distribution of redo/undo/temporary data.
The DPI score is a sum of scores from 9 items. In Table 7, we can realize that there are many fragmented
tables that should be reorganized for better performance.
Table 7. Disk Performance Index (DPI) for a sample DBMS
Education Performance Index (EPI)
The items of EPI are not related to system
performance directly. However,
a state of DBMS may be various depending on a person who manages it; how
often DBA tune it or how much DBA is trained.
The EPI score consists of 10 different items based on educational
levels of DBA and developers. Table
8 shows that some database developers caused the problems of DBMS.
Table 8. Education Performance Index (EPI) for a sample DBMS
System Performance Index (SPI)
The last category of TPI is a system performance
index (SPI). It includes
backup and recovery test, external database review, data modeling, and
parallel processing. Table 9
has 9 items to compute a score of SPI. Table 9. System Performance Index (SPI) for a sample DBMS
Finally, we can compute a score of TPI by a sum of four categories. The sample DBMS mentioned in the above has 570 TPI points out of 1,000. In addition, we grade the overall performance as well as each category. As shown in Table 10, the sample DBMS is C, which means that the DBMS is placed on the top 40 percent of most systems in terms of the total performance. Table 10. Total Performance Index (TPI) for a sample DBMS
The entire TPI tool is available on my personal DBA web site. http://www.bridgeport.edu/~jelee/dba/TPI.xls.
Artifact D
SDM (Server and Database Monitoring) Tool As a senior database administrator at IBM e-Business
hosting team, I needed a new tool to watch customer’s servers and DBMS.
Since the existing monitoring tools, such as Oracle OEM, IBM Tivoli
Monitoring, and BMC Patrol, are not very flexible with various IT
infrastructures, and limited in customization of their functions.
Therefore, I developed Server and Database Monitoring (SDM) tool
that supports any kinds of server and DBMS, and is very flexible in the
Internet environment. In
addition to monitoring service, SDM needs to detect and alarm abnormal
states of the systems, and generate periodic reports automatically.
What is SDM?
SDM is a web-based monitoring and reporting tool for
servers and databases in real-time. SDM
was developed by using ksh, awk and Perl5 for client side to collect the
meta-data, and gd library, zlib, rrdtool v1.0.28 and Perl5 for server side
to analyze them. Figure 2
shows the overall SDM architecture. As
shown in the figure, the meta-data are gathered from each server and DBMS
using a shell script. The
data are transferred into SDM server via FTP for the analysis.
Then, the output of analyzed data is served as two modes.
One is for an operator watching abnormal states that are not
identified by SDM server, and to generate regular reports.
The other is for a customer to check his/her own systems.
Both of them are served through web servers.
Figure 2. Architecture of Server and Database Monitoring (SDM) toolFeature 1: Server Monitoring
Feature 2: Database Monitoring
Figure 3. Screenshots of Database Monitoring ServiceFeature 3: Server Resource Reporting
Figure 4. Screenshots of Server Resource Reporting ServiceFeature 4: Problem Reporting
A SDM document and software are available on my personal DBA web site. Document: http://www.bridgeport.edu/~jelee/dba/SDM.ppt Software: http://www.bridgeport.edu/~jelee/dba/SDM.zipLast updated: 8/10/2006 |