Research
Research towards my PhD
Research in Wireless Networks
Internship Projects
Course Projects
Research towards my PhD
(
Advisors: Prof. Divyakant Agrawal and Prof. Amr El Abbadi, @ UCSB )
Elastic Data Management in the Cloud
With the launch of Amazon Web Services (AWS), the Utility (or Cloud) Computing paradigm has become extremely popular. The pay-as-you-go model, and the Elastic computing model has been extremely successful for both small and medium sized corporations. As a result, many similar services have come up since the inception of AWS, salesforce.com, RackSpace, Microsoft Azure, Google AppEngine etc to name a few. All these services scale the entire spectrum from Insfrastructure as a Service (IaaS) to Platform as a Service (PaaS). In the high end services like Google AppEngine, the scalability is automatically taken care of by Google, but with raw services such as EC2 and S3, which provides a huge amount of elastic compute power and storage capability, comes the challenge of automatic scaling of resources based on request patterns. Companies like RightScale Inc. provide auto scaling for the compute cycles, but the back-end database still remains the bottle-neck for many data driven web-applications. In this project, we are looking towards scaling the data management insfrastructure in the cloud. From our understanding, there has been two types of workloads that are predominant for web-applications:
- Single Key Workloads: Which is the most predominant workload requiring very high scalability, and this had led to the development and widespread success of large non-relational key-value stores such as Bigtable or Dynamo.
-
Traditional Transactional Workloads: This is not a significant fraction of the workload for web-applications, but still, many applications do require relational databases, and transactional guarantees. But the scalability requirements of these applications are not as demanding as the ones for the single key access patterns.
In addition, there has been a recent trend of Web 2.0 applications that require Scalable Transactional Access to Large Number of Small Groups of keys. Each of these applications require transactional access to more than a single key, the keys of keys are small, but the number of applications can be pretty large.
The Single Key Model has seen many successful and mature solutions, but the two latter models are still a mystery somewhere in the cloud. And this is the goal of this research project. Our focus is to develop a system which will act as the Data Management Layer in the Cloud, supporting the different kinds of workloads noted above. Even though from the application's perspective, the data management layer can have a uniform API, but the system data management system can itself be built on heterogenous components.
We are still in the process of designing the system, more details will be posted as the design materializes.
Hide Abstract
Related Publications:
- Sudipto Das, Divyakant Agrawal, Amr El Abbadi, "ElasTraS: An Elastic Transactional Data Store in the Cloud", [Paper] USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '09), in conjunction with USENIX '09.
- Sudipto Das, Shyam Antony, Divyakant Agrawal, Amr El Abbadi, "Clouded Data: Comprehending Scalable Data Management Systems", [Paper] UCSB CS Tecnical Report 2008-18, Nov. 2008.
Towards Parallelizing Data Stream Operators
Recent years have seen the increase in popularity in applications where the data is processed as a stream of tuples. These applications are known as Data Stream applications and the algorithms processing these streams need to provides answers on-line. Most of the existing work in stream processing are sequential in nature. With the advent of multi-core processors and their ubiquitous presence, the research community is striving for parallel algorithms to effectively utilize the parallelism of the modern processor paradigms. This project is aimed at designing parallel algorithms for data stream applications.
Hide Abstract
Related Publications:
- Sudipto Das, Shyam Antony, Divyakant Agrawal, Amr El Abbadi, "Thread Cooperation in Multicore Architectures for Frequency Counting over Multiple Data Streams", [Paper] in the 35th International Conference on Very Large Data Bases (VLDB) 2009.
- Sudipto Das, Shyam Antony, Divyakant Agrawal, Amr El Abbadi, "Thread Cooperation in Multicore Architectures for Frequency Counting Over Multiple Data Streams", [Paper] UCSB CS Tecnical Report 2009-04, March 2009.
- Sudipto Das, Shyam Antony, Divyakant Agrawal, Amr El Abbadi, "CoTS: A Scalable Framework for Parallelizing Frequency Counting over Data Streams", [Paper] In Proc of ICDE 2009, Shanghai, China.
- Sudipto Das, Shyam Antony, Divyakant Agrawal, Amr El Abbadi, "CoTS: A Scalable Framework for Parallelizing Frequency Counting over Data Streams", [Paper] UCSB CS Tecnical Report 2008-08, June 2008.
TCAM Conscious Algorithms for Frequent Element Queries
In this project, I am interested in designing TCAM conscious algorithms for answering frequent element and top-k queries over data streams. TCAM or Ternary Content Addressable Memory is characterized by constant time look-up operations in hardware. The current research is focussed on designing an efficient algorithm that can efficiently leverage this look-up property of the TCAMs.
Hide Abstract
Related Publications:
Anonymizing Weighted Social Network Graphs
The increasing popularity of social networks has initiated a fertile research area in information extraction and data mining. Although such analysis can facilitate better understanding of sociological, behavioral, and other interesting phenomena, there is growing concern about personal privacy being breached, thereby requiring effective anonymization techniques. If we consider the social graph to be a weighted graph, then the problem of anonymization can be of various types: node identity anonymization, structural anonymization, or edge weight anonymization. In this paper, we consider edge weight anonymization. Our approach builds a linear programming (LP) model which preserves properties of the graph that are expressible as linear functions of the edge weights. Such properties form the foundations of many important graph-theoretic algorithms such as single source shortest paths tree, all-pairs shortest paths, k-nearest neighbors, minimum cost spanning tree, etc. Off-the-shelf LP solvers can then be used to find solutions to the resulting model where the computed solution forms the weights of the anonymized graph. As a proof of concept, we choose the shortest paths problem and its extensions, prove the correctness of the constructed models, analyze their complexity, and experimentally evaluate the proposed techniques using real social network data sets. Our experiments demonstrate that not only does the proposed technique anonymize the weights, but it also improves the k-anonymity of the graphs while scrambling the relative ordering of the edge-weights, thereby providing robust and effective anonymization of the sensitive edge-weights.
Hide Abstract
Related Publications:
Top of Page
Research in Wireless Networks
ReTiMon: A Real Time Network Monitor
This project involves designing a tool that would help monitor a wireless network in real time.
This is an important problem as with the increasing popularity of the wireless networks and
their large scale deployment, managing such networks has become a challenging task. To be particular,
if there are problems in operation of these networks, then diagnosing them and troubleshooting them manually
has become a big problem. Most of the existing methods involve off-line analysis of the network performance,
but that is of little help when it comes to dealing with the problems in real-time. Real-time analysis is required
to ensure reliable operation of the network and guarantee user satisfaction.
We have designed a tool, that would help the users/ network adminitrators to analyse the network in real time.
This tool consists of a sniffer, that passively monitors the network, and a GUI that displays the various
metrics of the network as observed by the sniffer. Details of the project can be found in the technical report.
Hide Abstract
Source Tree:
The Source tree for the Visualizer can be found here
To run the client, the command is java client.MonitorClient [ip addr of the server].
To run the server, the command is java server.MainFrame
Technical Report
QoS Routing in Wireless Mesh Networks
Wireless Mesh Networks are an emerging field of research in the recent years. But very little has been done for providing QoS
enabled routing in Wireless Mesh Networks. This work consists of designing a routing protocol for Wireless Mesh Networks that
provides "strong" QoS guarantees in terms of Bandwidth and Delay. This protocol determines the routes on demand
and selects a route that can strictly comply with the bandwidth and delay requirements. The protocol takes the overall robustness
of the routes when selecting a route amongst multiple candidate routes.
Hide Abstract
Related Publications:
- Vinod Kone, Sudipto Das, Ben Y. Zhao, Heather Zheng, "QUORUM - Quality of Service in Wireless Mesh Networks", [Paper], In Mobile Networks and Applications Journal, Springer, Netherlands.
- Vinod Kone, Sudipto Das, Ben Y. Zhao, Heather Zheng, "QUORUM - QUality Of service RoUting in wireless Mesh networks", [Paper], In Proc. of QShine 2007, Vancouver, Canada.
Improvement of the performance of Ad-hoc routing
Protocols
Various Ad-hoc
routing protocol implementations suffer form certain implementation problems
like Communication Gray Zone, Fluctuating Neighbors etc. We hope to provide a
pragmatic approach towards improving the performance of the routing protocols in
this respect. A new module has been implemented that keeps track of the
stability of the neighbors to make route decisions and is also capable of
detecting the presence of new nodes in its neighborhood, even if they are not
participating in any data transfer, and explore possible routes through it. We
are also exploring the possible applications of this new module in Multipath
routing. In addition to obtaining the stability information, it gathers other
neighbor statistics to help the routing entity choose the best route from a set
of available routes.
Hide Abstract
Related Publications:
- Sudipto Das, Rajesh Roy, Pradip K. Das, "Optimizations to Multipath Routing Protocols in Mobile Ad hoc Networks", [Paper], In Proc. of International Conference on Emerging Applications of IT Feb 2006, Kolkata, India.
- Rajesh Roy, Sudipto Das, Pradip K. Das, "A Pragmatic Approach towards the Improvement of Performance of Ad Hoc Routing Protocols", [Paper], In Proc. of 4th Asian International Mobile Computing Conference, Jan 2006, Kolkata, India (AMOC 2006).
Improvement of the performance of TCP over Mixed Wireless
(Wired-cum-Wireless) Networks
In this project
we are working on improving the throughput / goodput of the TCP sources. We are
exploring a change in the congestion avoidance algorithm of the traditional
TCP-Reno flavor by using a constant congestion window to compensate for the
packets lost in wireless losses, as well as congestion losses. The entire
experimental work is mainly simulation based; all simulations are performed
using the network simulator ns-2
(VINT Project of UCB, LBL & Xerox PARC). We are also exploring the impact of
this modified congestion control algorithm on other aspects of TCP performance
such as Fairness.
Hide Abstract
Related Publications:
- Anup K. Ghosh, Sudipto Das, Rajesh Roy, Amitava Mukherjee, "Sender Side Intelligence for TCP Throughput Enhancement in Wired-cum-Wireless Networks", [Paper], In Proc. of International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC.07), September 2007, Athens, Greece.
- Rajesh Roy, Sudipto Das, Anup K. Ghosh, Amitava Mukherjee, "Modified TCP Congestion Control Algorithm for Throughput Enhancement in Wired-cum-Wireless Networks", [Paper], In Proc. of 4th Swedish National Computer Networking Workshop, Oct 2006, Sweden (SNCNW 2006).
- Anup K. Ghosh, Sudipto Das, Rajesh Roy, Amitava Mukherjee, "Constant Congestion Window approach for TCP - effect on Fairness", [Paper], In Proc. of 3rd Swedish National Computer Networking Workshop, Sep 2005, Halmstad, Sweden (SNCNW 2005).
Implementation of a Multi-Hop Ad-Hoc network at the
Jadavpur University Campus
In this project,
as a group of 2, we had set up a Multi- Hop Ad-Hoc network. As of now the
network connects the two sites of the Centre for
Mobile Communication & Computing (CMCC) Lab within the campus, thereby
creating a Wireless corridor. Gradually, it would be expanded outside the
campus, with many added functionalities. We had also developed a GUI based
Client-Server application that would help monitor the status and performance
of the other nodes currently present in the network.
Hide Abstract
Deployment of a Wireless Distribution System at the Jadavpur
University Campus
In this project,
as a group of 2, we deployed a Wireless Distribution System (WDS), in which an
isolated Access Point (AP) "not connected to the infrastructural network"
connects to another AP which is connected to the Wired Network to provide
services to its users. Hence a Distribution System over wireless is set up and
the remotely placed AP can deliver services, even without any link to the wired
network.In IEEE 802.11
terminology a "Distribution System" is system that interconnects so-called Basic
Service Sets (BSS). A BSS is best compared to a "cell", driven by a single
Access Point (one of those circles in the diagram below). So a "Distribution
System" connects cells in order to build a premise wide network which allows
users of mobile equipment to roam and stay connected to the available network
resources. A distribution system can Wired (typically Ethernet), or Wireless
(using the radio device inside the Access Point). One important aspect of WDS
(this in contrast to other existing wireless AP to AP connection schemes used in
for instance outdoor installations) is the fact that a single PC card in the
Access Point can assume multiple roles at the same time. It can "drive" a cell
(as in wired connected APs), and as such connects wireless clients to the
infrastructure, and it can maintain up to six different wireless connections to
other Access Points. For that to be possible the operational (frequency) channel
will need to be the same for the cell that is controlled by the AP and for the
wireless links to the other APs.
Hide Abstract
Technical
Report
Top of Page
Internship Projects
Intern @ IBM Almaden Research Labs, Almaden
(
Summer 2009 )
Intern @ Google, Mountain View
(
Summer 2007 )
Intern @ IBM Global Services, India
(
Summer 2005 )
Integrating SIP with SAP
Enterprise Portals
This work was
carried out during the summer training at IGSI, Kolkata in which I had performed
some research & developmental work towards integrating SIP phones (based on
Session Initiation Protocol) with SAP enterprise portal. Our goal was to
integrate a SIP phone with the other collaboration features of SAP Enterprise
portals. This would equip the SAP portals to make or receive phone calls. The main objective
of the project was to design a portal component that can be integrated with the
SAP Collaboration Features in order to make a call from a computer to a mobile
or basic landline phone without using PSTN. The goal is to use the Session
Initiation Protocol (SIP) for this purpose. I had tried to incorporate a SIP
phone as one of the SAP Collaboration Room features. SAP is one of the most
popular ERPs - short for enterprise resource planning,
a business management system that integrates all facets of the business.
Hide Abstract
Top of Page
Course Projects
Ruby under Scanner: Comparison with Java
(
Technical Report )
Graph Media: a multi featured Software for Graphical Analysis of Mathematical Functions
DFD Generator: a Java based tool the helps the user draw Data Flow Diagrams.
Top of Page