Data Mining

Published on November 2016 | Categories: Documents | Downloads: 43 | Comments: 0 | Views: 471
of 7
Download PDF   Embed   Report

Data Warehousing Mining













Modal Paper
Name of Subject : Data Warehousing & Mining

A Datawarehouse is a collection of -
(1) Corporate Information (3) Strategic Information
(2) Balanced Information (4) None of the Above
“The logical link between what the managers see in their decision support EIS applications and
the company operational activities.” It is suitable for -
(1) Data warehouse (3) Data Mining
(2) Database (4) All of the above
Data are organized according to ____________ instead of application.
(1) Object (3) Subject
(2) Time (4) All of above
Data are not updated or changed in anyway once they enter the datawarehouse, but are only
loaded and accessed. This nature of datawarehouse is -
(1) Subjective (3) Non-volatile
(2) Integrated (4) None of all
The metadata should at the very least contain -
(1) The structure of data (3) The mapping from the operational
environment to datawarehouse
(2) The algo used for summarization (4) All of above
A database while is built for online transaction processing, OLTP, is generally regarded as
__________ for datawarehousing as they have designed with a different set of needs of mind.
(1) Exactly right (3) Unsuitable
(2) Not wrong (4) None of above
Data cleansing is an important aspect of creating an efficient datawarehouse in that -
(1) The cleansing process has to remove (3) All types of queries even those which may
duplication and Reconcile difference. require low level information
(2) It is removal of certain aspects of (4) None of all
operational data.
______________ Power is another important aspect of __________
(1) Credibility, datawarehouse (3) Number conclusing power, datawarehouse
(2) Indicification data mining (4) All of above
Sandwitch paradigm advantages the following approach -
(1) Pre-mine the data to determine what (3) Maximization of transaction capacity
formats and data are needed to support
a data mining application.
(2) Design with a different set of needs in (4) All of above
OLTP uses data that runs the business while datawarehouse uses data _______________
(1) That analysis the business (3) That schedules the time
(2) That revokes the business (4) All of the above

Q.11. Datawahrehouses is the___________

(1) process of extracting and transforming
operational data into informational data
and loading it into a central data store
(2)Current a/c balance for this customer

(3)Data which is accessible via desktop query and
analysis tools by the decision makers.
(4) none of the above


__________ is the final component of the datawarehouse and is really of a diff. Dimension in
that it is not the same as data dranier from the operational environment.
(1) Data mining (3) Database
(2) Data warehouse (4) All of above
Q.13. Criteria for a Data warehouse -
(1) Load Performance

(2) Time complexity
Q.14. The datawarehouse can be used to -
(1) Bring letter products to market in
a more timely manner
(2) Understand the problem and
maintenance of it database

(3) Data Quality Management, Query
(4) (1) and (3)

(3) All of above

(4) Do the query processing
Q.15. Information data to typically stored in a format that __________________
(1) makes analysis much easier
(2) makes database efficient
(3) makes all simple
(4) none of above
Q.16. _______________ analysis solutions, commonly referred to as ______________
(1) multi dimensional, OMSP
(2) Single dimensional, OFTP
(3) Multidimensional, OLTP
(4) None of above
Q.17. Data marts are workgroup as departmental warehouse, which are small in size.
(1) typically 5-5-gb
(2) typically 1-10gb
Q.18. EIS is -
(1) which combine DSS tols
(2) Which combine datamining
(3) typically 10-20gb
(4) None of above

(3) which follows the rule of database
(4) All of above


IBM’S approach to data warehousing is to provide solutions for building the warehouse and to
assist the decision making process through a set of products and services. This set of products
and services on components defines -
(1) PS/QL (3) AS/400
(2) AS/300 (4) All of above
Data propagator relational capture and apply for ___________ is an IBM solution for moving
data into the datawarhouse.
(1) OS/400 (3) both above
(2) AL/PS (4) None
A datawarehouse server contains performance, capacity, scalability, open interfaces, multiple
data sources. These are -
(1) Constrains (3) Measurements
(2) Components (4) All of the above

Q.22. Parallel I/O means that a single uses can submit -

(1) A query operation against
(2) Multiple I/O processor

(3) both (1) and (2)

(4) None of all
Q.23. An AS/40 application can be accessed through the internet using -
(1) 5250/HTML workstation
(2) AS/400 components
(3) both (1) and (2)

(4) None of above.
Q.24. AS/400 www server simplifies the accessibility of our datawarehouse data to
(1) Company customers and potential
(2) Client access product
Q.25. The IBM client series program defines -
(1) A selected set of leading edge
products to help you through
selection process.
(2) Analysis of the datawarehouse
from the desktop
(3) both (1) and (ii)

(4) None

(3) Offering a wide range of end-uses
products to choose

(4) None of all
Q.26. The technique which places records into groups with similar characteristics -
(1) Classification
(2) Clustering
(3) Partitioning
(4) None of all
Q.27. This can be found in database catalogs, data-dictionaries and repositories -
(1) Metadata
(2) Intelligent Mines
(3) Prediction
(4) None
Q.28. The features query, report writing, graphics are contained by the product -
(1) Data guide
(2) Visualizer
(3) AMIS/400
(4) None
Q.29. As a data professional you need to establish _____________ that allows for the max reuse of
data as it moves through its life cycle.
(1) complex environment (3) database management system
(2) data management architecture (4) none of the above
Q.30. Which one is not a operational system.
(1) order processing
(2) inventory
Q.31. The term backend describes -
(1) the data repository used to
support the datawarehouse,
equipped with the software that
support the repository
(2) the tools used by the warehouse
end users to support their decision
making activities.

(3) general ledger
(4) None of these

(3) both (1) and (2)

(4) None

Q.32. DSS is sometimes synonyms with the ____________

(1) EIS
(2) Datawarehouse

(3) MIS
(4) None
Q.33. The two most important cogs in the datawarehouse team Macklin any one the end user and the
(1) Executive (3) Project leader
(2) DSS analysis (4) None
Q.34. Effects to translate vast amounts of data into useful knowledge is an exercise in ____________
(1) DSS
(2) Datawarehouse
(3) KDD
(4) None
Q.35. The process of studying data and using the content to discover trends and patterns is called _____
(1) datawarehousing
(2) datamining
(3) database management systems
(4) None of all
Q.36. Among the four key components of datawarehouse, one is -
(1) Organizational structure
(2) Change management
(3) approve method
(4) all of above
Q.37. These are ___________ layers of _________ within the applications.
(1) 8; data dictionary
(2) 7; metadata
(3) 6; database
(4) None f above
Q.38. Data provision specialists not only much be diplomats but also be __________
(1) Eloquent communicators
(2) Political Sammy key people
(3) Knowledge person
(4) None of above
Q.39. Who is responsible for promoting vision of datawarehouse.
(1) project director
(2) project manager
Q.40. Who is responsible for data security of project.
(1) project manager
(2) database administrator
(3) Chief executive
(4) None of above

(3) architect
(4) none of these
Q.41. “if you don’t see it in here don’t assume its being done” is true for =
(1) WBS
(2) MIS
Q.42. The project life cycle follows the process -
(1) ultimate design execute
(2) initiate, plan, execute, close
(3) datawarehouse
(4) none

(3) plan, analyze, execute
(4) none of the above
Q.43. Effective project management for a datawarehouse includes a _____________
(1) four on risk management,
communication on planning and
expectations management
(2) essential and requires the vision
to see the strategic picture while
being attentive to the technical

Q.44. The integrated plan for project is not used to -
(3) both (1) and (2)

(4) None

(1) monitor and control the project.
(2) Combine various phases involved
Q.45. Project estimation is done to know -
(1) time and efforts it will require
(2) cost of project
Q.46. A project pilot must be limited to -
(1) one subject area
(2) any no. of subject

(3) cut costs
(4) none of these

(3) funding requirements
(4) all of these

(3) maximum 5 subjects
(4) there are no such restriction.
Q.47. In risk management, when you do get caught, this is typically due to one situations -
(1) constraints
(2) unknowns
(3) assumptions
(4) all of above
Q.48. A technical editor must be selected before ___________
(1) the editing cycle of chapter can be
(2) each unique path (A,B,C) of
activities is represented based on
its dependencies
(3) both (1) and (2)

(4) none of above
Q.49. ____________ is a measurement of the no. of names in that table.
(1) Cardinality
(2) ordinal number
(3) variance
(4) none of these
Q.50. A table that records cities and zip codes in the united states may have __________ names
(1) 123, 100
(2) 123, 000
Q.51. The major advantages of bit mapped indexes is -
(1) space saving
(2) high speed
(3) 120, 500
(4) none of above

(3) ease to use
(4) none of these
Q.52. The ________ is a common method used to store data in the warehouse.

(1) ring form
(2) star form

The blank box is for -
(1) shipping platform
(2) shipping source
(3) shave selurna
(4) all of above


(3) query result
(4) none of above

Q.54. ___________ indexes deliver the most cost effective solution for the broadest range of queries.
(1) AS/400

Q.55. __________ kernel is the heart of the software.
(3) HP-UX
(4) None of all

(1) MDD

(4) None of all
Q.56. It results sets of joint processing were empty, It will result in -
(1) Error
(2) message no records selected
Q.57. A read only table space must be
(1) online
(2) have active transactions
(3) Repeat process
(4) None of all

(3) have pending transactions
(4) none of all
Q.58. Multi-CPU machines fall into 2 primary classes-symmetric multiprocessors and _____________
(1) massively parallel processors
(2) 486dx-2 and Pentium
Q.59. Partitioning of data refers to -
(1) splitting data from a large table
among a no. of smallest tables.
(2) Using a fired volume of data at
one point of time.
(3) parallel-max-servers
(4) none of all

(3) indexing of data

(4) none of all
Q.60. Oracle records every transactions in a file it calls ______________
(1) the log
(2) the new log
Q.61. A temporal component exists in ____________
(1) parallel behavior
(2) pattern discovery
(3) the OLTP
(4) none of all

(3) data mining
(4) none of all
Q.62. Data _________ is defined as a process of centralized data management and retrieval.
(1) mining
(2) warehousing
(3) both
(4) none
Q.63. Optimization techniques that we processor such as genetic combination, mutation & natural
selection in a design based on the concepts of natural evolution.
(1) genetic algo (3) both
(2) decision trees (4) none
Q.64. _________ refers to the security of error and degree of noise in the data.
(1) irrelevant field
(2) uncertainly
Q.65. The basic result of a datamining effort is -
(1) information
(2) operational efficiency
(3) potential applications
(4) none of above

(3) databases
(4) none
Q.66. _________ helps transforms vast amounts of data into information.
(1) datamining
(2) datawarehousing
(3) both (1) and (2)
(4) none
Q.67. ____________ are the processes of creating of creating a partition so that all the members of
each set of the partition are similar according to some metric.
(1) clustering (3) partitioning
(2) segmentation (4) all of above
Q.68. Data mining effects results in -
(1) faster decision making
(2) right information for management

Q.69. Clustering involves -

(3) more credibility of decisions
(4) all of these

(1) dumping similar sets of data
together from a larger and more
massive data set
(2) discovers the groupings at it
works with the input data

(3) based on the characteristics of members
of each cluster

(4) all of above
Q.70. __________ use a set of processing elements (02 nodes) analog to neural in the brain.
(1) neural networks
(2) risk management
(3) sales forecasting
(4) none of all
Q.71. __________ gives organization the opportunity to deploy specialized server which are optimized
for handling specific data management problems.
(1) online analytical processing (3) multidimensional
(2) client / server architecture (4) all of above
Q.72. _________ applications are quite diff. from OLTP.
(1) SMTP
(2) OLAP
(3) both of these
(4) all of above
Q.73. __________ servers have the means for storing multidimensional data in a compressed form.
(1) OLAP
(2) SMTP
Q.74. Data marts are essentially
(1) long term solutions
(2) short term solutions
(3) both (1) and (2)
(4) all of above

(3) permanent
(4) none of all
Q.75. The world discovered that a _________ environment without a data warehouse was an extremely
unsatisfactory thing.
(1) DSS environment (3) datawarehouse
(2) Datamart (4) none of these
Q.76. Datawarehouse are -
(1) arranged around the corporate
subject areas found in the
corporate data model.
(2) The structure and the content data
that resides in a datawarehouse
and the structure and the content
of data that resides in a datawart.
Q.77. Transformation is a process.
(1) to conform data in a std. system
(2) to provide solutions to queries by
Q.78. For a product which one is not a dimension.
(1) Price
(2) time of sale

(3) both of all

(4) none of above

(3) moves the data into datamart
(4) none of all

(3) category
(4) manufacturing process
Q.79. If the data sets are small and undimensional the preferred architecture is -
Q.80. OLAP tools can perform these functions -
(1) data synthesis
(2) data analysis
(3) any of these
(4) none of above

(3) answer queries
(4) all of above

Sponsor Documents

Or use your account on


Forgot your password?

Or register your new account on


Lost your password? Please enter your email address. You will receive a link to create a new password.

Back to log-in