Q: 1 Which of the following open source framework manages processing and storing for Big Data application?
Hadoop
Struts
Spring
Ant
[ Option A ]
In Big Data applications, large volumes of data need to be stored and processed efficiently across multiple machines. For this purpose, specialized frameworks are used that support distributed computing.
Hadoop is an open-source framework designed specifically for distributed storage (HDFS) and parallel data processing (MapReduce), making it suitable for Big Data applications.
| Framework | Type | Description |
|---|---|---|
| Apache Hadoop | Big Data Framework | It is used to store and process large data using distributed systems. |
| Apache Struts | Web Framework | It is used to build Java-based web applications using MVC architecture. |
| Spring Framework | Application Framework | It is used to develop robust and scalable Java applications. |
| Apache Ant | Build Tool | It is used to compile, build, and manage Java projects automatically. |
Q: 2 Which of the following is used for storing unstructured data in Big—Data environment?
In-Memory Database
Relational Database
NoSQL Database
SQL Database
[ Option C ]
In a Big Data environment, data often exists in unstructured or semi-structured forms such as text, images, videos, social media data, and sensor logs.
NoSQL Databases are designed to handle such data because they provide flexible schemas, horizontal scalability, and distributed storage. Examples include MongoDB, Cassandra, and HBase.
Q: 3 What is the primary purpose of Hadoop Distributed File System (HDFS) in Big Data Storage?
To compress file
To store in-memory data
To store relational data
To store large files across multiple machines
[ Option D ]
The Hadoop Distributed File System (HDFS) is a distributed storage system designed to store very large datasets across multiple machines in a cluster. It divides large files into smaller blocks and distributes them across different nodes, providing fault tolerance, scalability, and high-throughput access to data.
Q: 4 Which of the following is not a characteristics of Big Data?
Velocity
Visualisation
Volume
Variety
[ Option B ]
Big Data is commonly characterized by the 3Vs: Volume, Velocity, and Variety.
Visualization is a technique used to present or analyze data graphically, but it is not one of the fundamental characteristics of Big Data.
Q: 5 Which of the following is a Big—Data tools employed for real-time stream processing?
Hadoop
Apache Flink
Splunk
ERDPlus
[ Option B ]
Apache Flink is a Big Data processing framework designed for real-time stream processing and high-throughput data analysis. It processes continuous streams of data with low latency, making it suitable for applications such as real-time analytics, event processing, and monitoring systems.
Q: 6
Match the following in the context of Big—Data:
| Group—I | Group—II |
|---|---|
| (i) Structured Data | (P) Text files, Images, Videos |
| (ii) Semi-Structured Data | (Q) Fixed Format Data |
| (iii) Unstructured Data | (R) XML, JSON |
(i)—(Q), (ii)—(R), (iii)—(P)
(i)—(Q), (ii)—(P), (iii)—(R)
(i)—(P), (ii)—(R), (iii)—(Q)
(i)—(R), (ii)—(Q), (iii)—(P)
[ Option A ]
In Big Data, data is commonly classified into structured, semi-structured, and unstructured data based on how it is organized and stored.
Thank you so much for taking the time to read my Computer Science MCQs section carefully. Your support and interest mean a lot, and I truly appreciate you being part of this journey. Stay connected for more insights and updates! If you'd like to explore more tutorials and insights, check out my YouTube channel.
Don’t forget to subscribe and stay connected for future updates.