Sei sulla pagina 1di 47

INTERNSHIP PROJECT REPORT (Project Semester July- December 2011)

OPTIMUM SEARCH ENGINE


Submitted by

ANJANI PRASAD KUNWAR Registration No:10807973 K18E6A02 B.TECH(CSE)-H

Under the Guidance of MISS ANJLEE VERMA Department of Computer Science Lovely Faculty Of Technology And Science Lovely Professional University, Phagwara

DECLARATION

I, ANJANI PRASAD KUNWAR, student of B.TECH(HONS) CSE,under Department of Lovely Faculty of Technology And Sciences of Lovely

Professional University, Punjab, hereby declare that all the information furnished in this dissertation / capstone project report research and is genuine. This dissertation / report does not, to the best of my knowledge, contain part of my work which has been submitted for the award of my degree either of this university or any other university without proper citation. is based on my own intensive

( ANJANI PRASD KUNWAR 10807973 DATE:26/11/2011

CERTIFICATE

This is to certify that ANJANI PRASAD KUNWAR bearing Registration no. 10807973 has partially completed capstone project titled, OPTIMUM SEARCH ENGINE under my guidance and supervision. To the best of my knowledge, the present work is the result of his original investigation and study. No part of the work has ever been submitted for any other degree at any University.

Name of the Supervisor: Miss Anjlee Verma Lovely Faculty of Technology And Sciences. Lovely Professional University Phagwara, Punjab. Date: 26/11/2011

S.NO.
1.

CONTENT
Organization Overview
SARS Softavate Company Services Details of Outsourcing Services Profile of the Problem

PAGE NO.
1-3 3-3 3-6

1.1 1.2 1.2.1 2. 2.1 2.2 2.3 3. 3.1 3.2 3.3 3.4 4 4.1 4.2 4.3 5 5.1 5.2 5.3 6 6.1

Present State of Art Need of the System Problem Definition Existing System Introduction Existing Software FD for Present System Features of System to be Developed Problem Analysis Product Definition Feasibility Analysis Project Plan Software Requirement Analysis Introduction General Description Specific Requirements Design System Design

6-6 6-6 6-7

7-8 8-9 9-10 10-11

11-12 13-16 16-16

16-17 17-21 22-29

29-30

6.2 6.3 7. 7.1 7.2 7.3

Design Notations,Detailed Design,Flowcharts Pseudo Code Conclusion & Refrences Conclusion Future Scope of Project Bibliography

30-35 35-37

37-38 38-39 39-40

ACKNOWLEDGEMENT

Apart from the efforts of me, the success of any project depends largely on the encouragement and guidelines of many others. I take this opportunity to express my gratitude to the people who have been instrumental in the successful completion of this project.

I would like to show my greatest appreciation to Miss Anjlee verma. I cant say thank you enough for his tremendous support and help. I feel motivated and encouraged every time I attend his meeting. Without his encouragement and guidance this project would not have materialized.

The guidance and support received from all the members who contributed and who are contributing to this project, was vital for the success of the project. I am grateful for their constant support and help.

Organization Overview
1.1SARS Softavate Company SARS Softavate Technologies Pvt Ltd is a Software Development Company providing a full range customized software solutions to Information Technology and other related areas. SARS Softavate Technologies Pvt Ltd is a global services provider delivering technology-driven business solutions that meet the strategic objectives of our clients and create solutions around specific needs of Research organizations/industries. It delivers unmatched business value to customers through a combination of process excellence, quality frameworks and service delivery innovation. SARS Softavate Technologies Pvt Ltd has delivered hundreds of cost effective and high-quality software solutions for a wide range of industries and domains. These solutions have included consumer and business software development, web hosting, retail manufacturing, real estate, community services and many others. Custom Software Solutions SARS Softavate Technologies Pvt Ltd does its best to provide you with custom software solutions that will help you to keep one step ahead your competitors by continuously improving your IT-based business solutions. We use the latest software development platforms and tools to achieve this. We also employ the latest project management techniques and software engineering practices. We offer a wide range of custom web and software programming services in a variety of application areas. Please click on the links below to see samples:

Custom eCommerce Solutions (B2B and B2C) Database Business Applications Distributed (client-server) applications Internet/Intranet Applications

Websites and web-tools

The All About Automation, Global Standard Look and Feel, User Friendly, Admin defined permissions, Effective Dashboard, Chart Report, Easy Customizable Reports and not the least but Cost Effective Software Solution with Auto Alert / Auto Reminder functionalities, to keep you update and dont let you worry to manage your Accounts, HR , IT, Management and Admin to cater all your IT needs and is specially designed for your company to meet your working environment. We at, SARS Softavate Technologies Pvt. Ltd have decades of experience in the field of Software Marketing and software development to meet the specific requirement of the client. We also provide the best IT services which includes: a) Customize Software Solution We are specialized in Internet / Intranet based Custom Software Solution in various technologies i.e. Java, J2EE, Struts, Spring, GWT, Hibernate, .NET, PHP etc. We also provide Consultancy Services for the Software Development to evaluate your IT needs for your company/organization in order to get you the best results. Having decades of experience in the Software Marketing, our Market Expert have answers and solution for your every doubts and queries. In last 5 years, we have successfully delivered 100 of projects ranging from School Information Management System, Invoice Management Software, Inventory Management, CRM, Multi Level Marketing Software, Lead Management Software, AMC Management Software and Recruitment System and so on. b) Product Development After continuous evaluation of the needs of IT / Software Automation in the current scenario and our interaction with hundreds of clients since last decades with their common/unique requirement to meet there specific needs, we developed an unique concept to cater the need for INVETORY MANAGEMENT, MANAGEMENT HUMAN INVOICING SYSTEM, RESOURCE MANAGEMENT, RECRUITMENT CUSTOMER RELATIONSHIP SYSTEM,

MANAGEMENT

MANAGEMENT

SYSTEM

and

SCHOOL

INFORMATION

MANAGEMENT

SYSTEM, with user defined fields and functionality. c) Outsourcing There are several activities in a business that are not integral to it and these processes can be successfully managed through outsourcing. To cut your cost and to get better services, so that you can concentrate on your core business activities, We at SARS SOFTAVATE TECHNOLOGIES PVT LTD. having proficient team with more than 5 years of experience in the field of HR/Payroll Processing, Consultancy Services, Data Management, Event Management provides you onshore and offshore outsourcing facility. 1.2:SERVICES Software Services

Custom Software Services Software Analysis Software Development Software Testing Database Designing

Web Services

e-Business Solutions Website Design Web Development SEO Services Logo Design Domain Hosting

1.2.1

Details of Outsourcing Services

a) PAYROLL OUTSOURCING One of the integral parts of HR is Payroll. Timely and accurate salary and wages calculation coupled with Statutory Compliance is automatically expected. The task of payroll becomes complicated in today's industries due to multiple statutory requirements, MIS requirements, calculations based on slabs and formulas and other HR policies. Our Payroll Services takes care of all your requirements relating to management of employees' Payroll. Our Payroll services provide complete records of the Employees, Pay Slips and Attendance Register, computation of all allowances & deductions and generation all Statutory and MIS reports.

KEY FEATURES OF PAYROLL OUTSOURCING i. Our services can be customized to user requirement. ii. All Statutory Deductions such as PF / ESI / Bonus / Gratuity / TDS / PT are taken care of. If the Payroll of more than one Location is managed pay slips can be defined depending on Location wise. iii. Employees can be referred either with Employee ID or Reference No. iv. No. of days in the month can be flexibly defined. v. Each Branch wise working days can be defined. vi. No. of days present and No. of days on which Salary is calculated (Including Paid Leave) both are considered with formulating a field value. vii. Rounding off options & Carrying over of rounding off values to next month. viii. Leave Management Tight Integration / Loose Integration. ix. OT Rate Calculation Hourly basis or on Per day basis. x. Multiple Branch Details in Single File. xi. PF / ESI / PT rate user definable on Monthly basis. xii. Arrears Calculation. xiii. No restriction on Record Management Dates Any no. of years data can be Managed.

xiv. User Definable Salary Heads. xv. Salary Definition on the Basis of Group / Scale. xvi. Salary components can be defined as Inputs / Calculated Fields / OT. xvii. Calculation of Salary Fields depends on Formulas / Slab. xviii. Advance Management. xix. Leave Details Management with Unlimited No. of Leave Types. xx. Holiday Definition. xxi. Increment. xxii. Detailed Master Data Capture. xxiii. Company Master. xxiv. Employee Details. xxv. Bank / Branch Details For Bank Statement and for standing Instruction. xxvi. Basic and Essential HR Details. xxvii. Salary Definition on the Basis of Group / Scale. xxviii. Leave Details Management with Unlimited No. of Leave Types. xxix. Salary Components not considered in Regular Salary Sheet. xxx. Strong TDS Computation Engine. xxxi. Salary calculation for the purpose of TDS Estimation. xxxii. Gross Salary Splitting Tax planning. xxxiii. Form 16, Form 12 BA & ETDS Generation. xxxiv. Income Tax Computation. xxxv. Leave Travel Allowance. xxxvi. Medical Reimbursement. xxxvii. Library of Blank Statutory Reports. xxxviii. Holding Salary of List of employees. xxxix. Locking Months. xl. Exporting Salary Details to Excel. xli. Creating new File from Existing Data. xlii. Advance Employee Delete option. xliii. Strong List of general utilities like Back up / Restore / Update. xliv. Standing Instructions Loan / SSS / Insurance / Group Insurance.

xlv. Pension b) SOFTWARE PRODUCT OUTSOURCING Successful software product development companies are those that react quickly to constantly changing market conditions. They need to deliver outsourced products of the highest quality at the lowest price that satisfies customer needs. Our Business Leaders and their years of research and evaluation of Software Products, we are now able to deliver you the best Software Solution as par your need. We also provide consulting services for the Software Products and share our Product Analysis Report to let you be able to go for the best.

Profile of the Problem


2.1 Present State of Art: In the existing system we found that the entire work of the NGO i.e. Management of Information regarding Establishment, Operations, Accounts, Day to Day working Details all were present in the different file formats. To search something from that huge amount of data was a very difficult and time consuming work. Searching over those huge amounts of data, user has to face many problems which become limitation of that existing system. That includes: 2.2 Need of the System: Manual Search. Reading each and every file very carefully. Maintaining a catalogue of prime and important information. Very time consuming which results in wastage of manpower loss.

As a result, sometimes company has to face very slow output from their operations. So many new features are added and Optimum Search Engine System is being developed to overcome all these limitations. 2.3 Problem Definition

The problem definition is to identify what information is to be processed, what functions and performance are desired, what interfaces are to be established, what design constraints exists and what validation criteria are to be adopted.

This phase of system development is of great importance because it is must for system analysts. It is necessary to have a deep knowledge of the topic on which he/she is working. We should study the existing system on which the company is already working and should note down the limitation and plus point of it which is already described in the previous chapter. We should have visit and inspection in the company to study the system. We should also study the output provided by the existing system. It is good habit of the analysts to use the plus point of the existing system.

In the existing system we found that the entire work of the NGO i.e. Management of Information regarding Establishment, Operations, Accounts, Day to Day working Details all were present in the different file formats. To enquire something from that huge amount of data was a very difficult and time consuming work.

Existing System
3.1 Introduction

A search engine is an information retrieval system designed to help find information stored on a computer system. The search results are usually presented in a list and are commonly called hits. Search engines help to minimize the time required to find information and the amount of information which must be consulted, akin to other techniques for managing information overload.

Keeping in mind the drawbacks of the present system and to curb the same, Optimum Search Engine System is being developed. Optimum Search Engine System is far more efficient than the present system in migrating one million documents from Manual Search System to Optimum Search Engine System as it overcomes the shortcomings of the previous system due to its following features.

Objective of the project is to optimize the Searching process within the systems file.

Along with Developer as a front-end application this project uses MY SQL in the back-end for Database Management. This project has been divided into various modules for better understanding.

3.2Existing Software

Search engines provide an interface to a group of items that enables users to specify criteria about an item of interest and have the engine find the matching items. The criteria are referred to as a search query. In the case of text search engines, the search query is typically expressed as a set of words that identify the desired concept that one or more documents may contain.[1] There are several styles of search query syntax that vary in strictness. It can also switch names within the search engines from previous sites. Whereas some text search engines require users to enter two or three words separated by white space, other search engines may enable users to specify

entire documents, pictures, sounds, and various forms of natural language. Some search engines apply improvements to search queries to increase the likelihood of providing a quality set of items through a process known as query expansion.

To provide a set of matching items that are sorted according to some criteria quickly, a search engine will typically collect metadata about the group of items under consideration beforehand through a process referred to as indexing. The index typically requires a smaller amount of computer storage, which is why some search engines only store the indexed information and not the full content of each item, and instead provide a method of navigating to the items in the search engine result page. Alternatively, the search engine may store a copy of each item in a cache so that users can see the state of the item at the time it was indexed or for archive purposes or to make repetitive processes work more efficiently and quickly. Other types of search engines do not store an index. Crawler, or spider type search engines (a.k.a. real-time search engines) may collect and assess items at the time of the search query, dynamically considering additional items based on the contents of a starting item (known as a seed, or seed URL in the case of an Internet crawler). Meta search engines store neither an index nor a cache and instead simply reuse the index or results of one or more other search engines to provide an aggregated, final set of results.

3.3:DFD for present system A SIMPLE INTERFACE REPRESENTATION

A PROPOSED SYSTEM DFD

3.4 FEATURES OF NEW SYSTEM TO BE DEVELOPED

FEATURES:

Centralized Data Storage. Multiple User Access. Data Security. Optimized Search over the System.

Centralized Data Storage: Optimum Search Engine System is a completely web based system in which all data are stored on a centralized web server.

Multiple User Access: In this system the files will be stored on a web server that supports multiuser architecture. So, thousands of user can use the systems simultaneously. Data Security: In this system, since all data are stored on a centralized web server. On which only authenticate user can use the functionality of the system. So, there is no chance of any unauthorized access of data.

Optimum and Efficient Data Search: The proposed system will result in a very fast output of the input process that saves enormous amount of time. Also the system provides various optimized functionality for the same process such as (AND, OR clause) which makes the system very efficient.

Problem Analysis
4.1 PRODUCT DEFINITION Search Engine is a standalone system.It provides modules for crawling,indexing,sorting and searching files, web pages in Applet format.

1. Search engine : A search engine is an information retrieval system designed to help find information stored on a computer system such as on the World Wide Web inside a corporate or proprietary network, or in a personal computer. 2. Crawler : A web crawler (also known as a Web spider or Web robot) is a program or automated script which browses the World Wide Web in a methodical, automated manner. Other less frequently used names for Web crawlers are ants, automatic indexers, bots, and worms. 3. Indexing : Search engine indexing entails how data is collected,parsed, and stored to facilitate fast and accurate retrieval. 4. Web directories : A web directory is a directory on the World Wide Web It specializes in linking to other web sites and categorizing those links.A web directory is not a search engine and does not display lists of web pages based on keywords instead, it lists web sites by category and subcategory. The categorization is usually based on the whole web site, rather than one page or a set of keywords, and sites are often limited to inclusion in only one or two categories. Web directories often allow site

owners to directly submit their site for inclusion, and have editors review submissions for fitness. 5. URL normalization : URL normalization (or URL canonicalization) isthe process by which URLs are modified and standardized in a consistent manner. The goal of the normalization process is to transform a URL into a normalized or canonical URL so it is possible to determine if two syntactically different URLs are equivalent. 6. Lexicon : A lexicon can be a list of words together with additional wordspecific information, i.e., a dictionary 4.2 Feasibility ANALYSIS

Depending on the results of the initial investigation, the survey is expanded to a more detailed feasibility study. Feasibility study is a test of a system proposal according to its workability, impact on the organization, ability to meet user needs, and effective use of resources. The objective of a feasibility study is not to solve the problem but to acquire a sense of its scope. During the study, the problem definition is crystallized and aspects of the problem to be included in the system are determined. Consequently, costs and benefits are estimated with greater accuracy at this stage.

The result of the feasibility study is the formal proposal. This is simply a report a formal document detailing the nature and scope of the proposed solution. The proposal summarizes what is known and what is going to be done. It consists of the following:

Statement of the problem. Summary of findings and recommendations. Details of findings. Recommendations and conclusions.

Once it has been determined that a project is feasible, the analyst can go ahead and prepare the project specification which finalizes project requirements. Generally Feasibility studies are undertaken within tight time constraints and normally culminate in a written and oral feasibility report. The contents and recommendations of such a study will be used as a sound basis for deciding whether to proceed, postpone or cancel the project. Thus, since the feasibility study may lead to the commitment of large resources, it becomes necessary that it should be conducted competently and that no fundamental errors of judgment are made.

The key consideration in feasibility analysis are :

1. Economic Feasibility: 2. Technical Feasibility: 3. Behavioral Feasibility:

4.2.1

Economic Feasibility:

Economic analysis is the most frequently used method for evaluating the effectiveness of the candidate system. More commonly known as cost/benefit analysis, the procedure is to be determining the benefits and savings that are expected from a candidate and compare them with costs. If benefits outweigh costs, then the decision is made to design and implement the system.

A systems financial benefit must exceed the cost of developing that system. i.e. a new system being developed should be a good investment for the organization. Economic feasibility considers the following i. The cost to conduct a full system investigation. ii. The cost of hardware and software for the class of application. iii. The benefits in the form of reduced cost or fewer costly errors. iv. The cost if nothing changes (i.e. The proposed system is not developed).

The proposed Optimum Search Engine System is economically feasible because

i. The system requires very less time factors. ii. The system will provide fast and efficient automated environment instead of slow and error prone manual system, thus reducing both time and man power spent in running the system.

iii. The system will have GUI interface and very less user-training is required to learn it.

iv. The system will provide service to view various information for proper Managerial decision making. 4.2.2 Technical Feasibility:

Technical feasibility centers around the existing computer system (Hardware and Software etc) and to what extend it support the proposed addition. For example, if the current computer is operating at 80 percent capacity - an arbitrary ceiling - then running another application could overload the system or require additional Hardware. This involves financial considerations to

accommodate technical enhancements. If the budgets are a serious constraint, then the project is judged not feasible. In this project, all the necessary cautions have been taken care to make it technically feasible. Using a key the display of text/object is very fast. 4.2.3 Behavioral Feasibility:

People are inherently resistant to change, and computers have been known to facilitate change. An estimate should be made of how strong a reaction the user staff is likely to have toward the development of a computerized system. Therefore it is understandable that the introduction of a candidate system requires special efforts to educate and train the staff. The software that is being developed is user friendly and easy to learn. In this way, the developed software is truly efficient and can work on any circumstances, tradition, locales.

4.3

PROJECT PLAN

Project Plan involves the following steps: Feasibility Study Requirement Analysis Requirement Specification

FEASIBILITY STUDY

Feasibility study is used to determine whether it is cost effective to go ahead with system and whether it is actually possible. Optimum Search Engine System is cost effective to go ahead with and is technology-wise, schedule-wise, operationality-wise and legality wise feasible as stated under the previous section of Feasibility Study.

Software Requirement Analysis


5.1:Introduction

This involves Identifying of the requirements and needs of the system and modeling these needs in terms of the processes carried out. Optimum Search Engine System has been developed to fulfill the requirements stated by the client. This project will make effective and optimizes the process of searching. In order to implement the above requirement the following steps need to be followed:

Step 1: Upload all files of any kind into centralized web server.

Step 2: Delete any unnecessary files from the centralized web server.

Step 3: Apply Search criteria for the required query.

Step 4: Find the solution from the output.

Step 5: Improve your search criteria by applying provided clauses.

5.2:General Description MODULES

Admin View Information, Upload Files, Delete Files Free Text Search Data Search

User Free Text Search, Data Search

Help User Manual, FAQ

DATA FLOW DIAGRAM (DFD)

LEVEL 0:

LEVEL 1:

ENTITY_RELATIONSHIP DIAGRAM

Use Case Diagram

Class Diagram

5.3 Specific Requirements

H/W Requirements SERVER CLIENT Technology SERVER: Pentium IV 2.4 GHz or Above 1 GB RAM or More Intel Motherboard 2*40 GB HDD 104 Key Standard Key Board CLIENT: Pentium IV Minimal of 256 MB RAM or More

S/W Requirements Front End:HTML Java Server Page Servlet Java Script Ext JS AJAX

Back End:MySQL HTML

The language used to develop static web pages is called Hyper Text Markup Language (HTML). HTML is the language interpreted by a Browser. Web Pages are also called HTML documents. HTML is a set of special codes that can be embedded in text to add formatting and linking information. HTML is specified as TAGS in an HTML document. Tags are instructions that are embedded directly into the text of document. An HTML tag is a signal to a browser that it should do something other than just throw text up on the screen.

JSP (Java Server Pages)

JSP stands for Java Server Pages. JSP pages are actually scripts that are run, or executed, on the web server. The script is interpreted from top to bottom to create HTML pages that are sent to the browser for display. Before discussing more about JSP first we have to understand J2EE architecture. Below figures explain clearly explains all about J2EE architecture

Introduction to JSP Java Server Pages (JSP) is a technology based on the Java language and enables the development of dynamic web sites. JSP was developed by Sun Microsystems to allow server side development. JSP files are HTML files with special Tags containing Java source code that provide the dynamic content. The following shows the Typical Web server, different clients connecting via the Internet to a Web server. In this example, the Web server is running on Unix and is the very popular Apache Web server.

JSP source code runs on the web server in the JSP Servlet Engine. The JSP Servlet engine dynamically generates the HTML and sends the HTML output to the clients web browser.

First static web pages were displayed. Typically these were peoples first experience with making web pages so consisted of My Home Page sites and company marketing information. Afterwards Perl and C were languages used on the web server to provide dynamic content. Soon most languages including Visual basic, Delphi, C++ and Java could be used to write applications that provided dynamic content using data from text files or database requests. These were known as CGI server side applications. ASP was developed by Microsoft to allow HTML developers to easily provide dynamic content supported as standard by Microsofts free Web Server, Internet Information Server (IIS). JSP is the equivalent from Sun Microsystems, a comparison of ASP and JSP will be presented in the following section.

JSP is easy to learn and allows developers to quickly produce web sites and applications in an open and standard way. JSP is based on Java, an object-oriented language. JSP offers a robust platform for web development. Main reasons to use JSP: Multi platform Component reuse by using JavaBeans and EJB.

Advantages of Java.

JSP architecture

JSPs are built on top of SUN's servlet technology. JSPs are essential an HTML page with special JSP tags embedded. These JSP tags can contain Java code. The JSP file extension is .jsp rather than .htm or .html. The JSP engine parses the .jsp and creates a Java servlet source file. It then compiles the source file into a class file, this is done the first time and this why the JSP is probably slower the first time it is accessed. Any time after this the special compiled servlet is executed and is therefore returns faster.

JavaScript The web site development environment should also provide the facility for Validating user input. This facility is available in JavaScript. JavaScript is embedded into an HTML program because JavaScript uses the filename.html and the HTTP protocol to transport itself from the

web server to the clients browser where the JavaScript executes and processes client information.

JavaScript is an object-oriented language that allows creation of interactive Web Pages. JavaScript allows user entries, which are loaded into an HTML form to be processed as required. This empowers a web site to return site information according to a users requests. JavaScript was designed to add interactivity to HTML pages JavaScript is a scripting language (a scripting language is a lightweight programming language) A JavaScript consists of lines of executable computer code A JavaScript is usually embedded directly into HTML pages JavaScript is an interpreted language (means that scripts execute without preliminary compilation) Everyone can use JavaScript without purchasing a license JavaScript is a simple to comprehend, easy to use, general purpose scripting language. When used in conjunction with a Web browser's Document Object Model (DOM), it can produce powerful dynamic HTML browser-based applications which also can feature animation and sound.

MYSQL Introduction:

The MySQL (R) software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software. MySQL is a trademark of MySQL AB.

History of MySQL

We started out with the intention of using mySQL to connect to our tables using our own fast low-level routines. However, after some testing, we came to the conclusion that mySQL was not fast enough or flexible enough for our needs. This resulted in a new SQL interface to our database but with almost the same API interface as mySQL. This API was designed to allow third-party code that was written for use with mySQL to be ported easily fr use with MySQL. The derivation of the name MySQL is not clear. Our base directory and a large number of our libraries and tools have had the prefix ``my'' for well over 10 years. However, co-founder Monty Widenius's daughter is also named My. The name of the MySQL Dolphin (our logo) is Sakila.

Main Features of MySQL

It was chosen by the founders of MySQL AB from a huge list of names suggested by users in our ``Name the Dolphin'' contest. The winning name was submitted by Ambrose Twebaze, an open source software developer from Swaziland, Africa. According to Ambrose, the name Sakila has its roots in SiSwati, the local language of Swaziland. Sakila is also the name of a town in Arusha, Tanzania, near Ambrose's country of origin, Uganda. The following list describes some of the important characteristics of the MySQL Database Software. Internals and Portability Column Types Commands and Functions Security Scalability and Limits

Connectivity

How Big MySQL Tables Can Be

MySQL Version 3.22 had a 4 GB (4 gigabyte) limit on table size. With the MyISAM storage engine in MySQL Version 3.23, the maximum table size was increased to 8 million terabytes (2 ^ 63 bytes). With this larger allowed table size, the maximum effective table size for MySQL databases now normally is determined by operating system constraints on file sizes, not by MySQL internal limits.

The following table lists some examples of operating system file-size limits:

Operating System Linux-Intel 32-bit Linux-Alpha Solaris 2.5.1 Solaris 2.6 Solaris 2.7 Intel Solaris 2.7 UltraSPARC

File-Size Limit 2 GB 8 TB 2 GB 4 GB 4 GB 512 GB

DESIGN
6.1:System Design

The design methodology used in the presented work is Bottom-up. A bottom-up approach is piecing together systems to give rise to grander systems, thus making the original systems sub-systems of the emergent system. In a bottom-up approach the individual base elements of the system are first specified in great detail. These elements are then linked together to form larger subsystems, which then in turn are linked, sometimes in many levels, until a complete top-level system is formed. This strategy often resembles a "seed" model, whereby the beginnings are small but eventually grow in complexity and completeness. The biggest challenge in developing a website is to master the architecture of the website. However we can easily design a website using the bottom-up approach. The bottom-up approach in developing website involves the following steps: Brainstorming: In this step you brainstorm a list of items you would like to include in your application. For example while migrating one million documents from Documentum to FilenetP8 we need to follow a definite flow of steps to accomplish the same. Each step will involve interaction between two or more bodies and so on. Grouping: Here the items listed in the brainstorming step are categorized into 4-5 categories. Give a specific name to each category. In Transportation Funding, folder structure extraction, folder migration, document migration, Deletion of corresponding document /folder from documentum upon successful migration to filenet P8 and respective logging are such groupings.. Critique the categories: In this step we look at our categories with more critical eyes and refine them further. Revise your categories: A final revision of the categories is done. The categories made should be accurate and precise to their name as it would make the formulation and implementation of the application successful. Develop a flow-chart: Finalize the categories and the items they will contain. Once this is done prepare a flow-chart. This flow-chart will become the roadmap for application design. 6.2(Design Notations,Detailed Design,Flowcharts)

SDLC Project Life Cycle


Feasibility Study

Requirement Analysis

Planning

Design

Coding

Testing

Implementati on

Maintenance

Data Flow Diagram (DFD)

A Data Flow Diagram (DFD) is a graphical representation of the "flow" of data through an information system. A data flow diagram can also be used for the visualization of data processing (structured design). It is common practice for a designer to draw a context-level DFD first which shows the interaction between the system and outside entities. This context-level DFD is then "exploded" to show more detail of the system being modeled. Data flow diagrams were invented by Larry Constantine, the original developer of structured design, based on Martin and Estrin's "data flow graph" model of computation. Data flow diagrams (DFDs) are one of the three essential perspectives of Structured Systems Analysis and Design Method SSADM. With a dataflow diagram, users are able to visualize how the system will operate, what the system will accomplish, and how the system will be implemented. Dataflow diagrams can be used to provide the end user with a physical idea of where the data they input ultimately has an effect upon the structure of the whole system from order to dispatch to restock how any system is developed can be determined through a dataflow diagram. A data flow diagram illustrates the processes, data stores, and external entities in a business or other system and the connecting data flows.

The four components of a data flow diagram (DFD) are:

Components of Data Flow Diagram

External Entities/Terminators are outside of the system being modeled. Terminators (also referred to as sources or sinks, depending on whether data flows from or into them) represent where information comes from or where it goes. In designing a system, we have no idea about what these terminators do or how they do it.

Processes modify the inputs in the process of generating the outputs

Data Stores represent a place in the process where data comes to rest. A DFD does not say anything about the relative timing of the processes, so a data store might be a place to accumulate data over a year for the annual accounting process.

Data Flows are how data moves between terminators, processes, and data stores (those that cross the system boundary are known as IO or Input Output Descriptions).

Every page in a DFD should contain fewer than 10 components. If a process has more than 10 components, then one or more components (typically a process) should be combined into one and another DFD be generated that describes that component in more detail. Each component should be numbered, as should each subcomponent, and so on. Database Design

Java API

MIGRATION UTILITY

Java API

DFC Libraries

WSI Listener EJB Listener Websphere

Content Server

Content Server

FILE

FILE RDBMS STORE

RDBMS

STORE

6.3:Pseudo code Table Struture

/*Table structure for table login */

CREATE TABLE login ( user_name varchar(40) NOT NULL PRIMARY KEY, password varchar(20) NOT NULL )

/*Table structure for table counter */

CREATE TABLE counter ( openFile_name varchar(40) NOT NULL, count int(8) default NULL )

/*Table structure for table upload */

CREATE TABLE upload (

author varchar(40) default NULL, file_name varchar(1024) NOT NULL, upload_date datetime NOT NULL, file_size int(11) NOT NULL, file_type varchar(10) NOT NULL )

/*Table structure for table uploadhtml */

CREATE TABLE uploadhtml ( author varchar(40) default NULL, file_name varchar(100) NOT NULL, upload_date datetime NOT NULL, file_size int(11) NOT NULL, file_type varchar(10) NOT NULL, meta_content varchar(100) default NULL )

/*Table structure for table view_search */

CREATE TABLE view_search ( search_text varchar(40) default NULL, file_names varchar(200) default NULL, search_date datetime default NULL, viewed_files varchar(100) default NULL, boolean_search_type varchar(3) NOT NULL, search_type varchar(20) default NULL )

Conclusion & References


5.1 Conclusion

Scope of the Project The Optimum Search Engine System software is being developed as an accurate and efficient system for the user. In this system the record of the each master detail are preserved along with their transaction related to them. The system is also made secured as all the uploading and deletion can be done by the authorized person. A proper problem analysis was done and a plan was prepared to automate the system. By following the approach and methodology planned for the project.

Limitations Although this project has so many advantages in a live environment, still it faces some problems. But these limitations do not effect whole application and other existing systems too.

The Optimum Search Engine System is developed to take care of all the major aspects of the process of searching, Adding user, free text search with AND & OR clauses, data search by file name as well as file extension with priority, about class / section, help and FAQs. Using this application one can search different types of files at a time. This enables the user to search the records in time saving mode but the application does not consider the searching of records in a global environment i.e. files placed on other servers.

The limitations of this project are numbered below: 1) It is not integrated with other existing modules like image search. 2) This application does not consider the searching of records in a global environment i.e. files placed on other servers.

5.2

Future Scope of Project

Further scope The Optimum Search Engine System application software is to add image search and global search environment to enhance the software. It is highly likely that the scope will change as the web application moves forward the web process model should be incremental. This allows development team to freeze the scope for one increment so that an operation web application of the preceding increment but once the second increment commence, scope is again frozen temporarily. This approach enables the web application team to work with having to accommodate continual stream of change be still recognized. The continuous evolutions characteristics of most web application. Beside that the following basic quality in the software always safeguards the future scope of the software.

Extensibility: This software is extended in a way that its original developer may not expect. The following principals enhance extensibility like hide data structure. Avoid traversing multiple links or methods, avoid age statement on object type and distinguish public and private operations.

Portability: Since it is an internet based application, so its portability and usability depends upon the client connected with the internet. The interface designed that is the web page designing which one of the major part of web application because it is the first impression regardless of the value of its contents interface should must grab a potential user immediately. 5.3 BIBLIOGRAPHY

The books and Search Engine which I referred during the development of this project where of great help to me. It assisted me a lot and rather gave me the confidence to perform such a work. The names of some of these books are listed below: 1. The Complete Reference Java

Herbert Schildt (Tata McGraw-Hill) 2. Begging Java 2

Ivor Horton (SPD) 3. Professional Java Programming

Brett Spell (SPD) 4. An Introduction To Database Systems

C.J. DATE 5. Database Systems Concept. KORTH, SUDARSHAN 6. PL/SQL. Ivan Bayross 7. System Analysis and Design. Awadh.

8. Software Engineering Roger S. Pressman , Ph.D. McGraw-Hill Publication

Potrebbero piacerti anche