Kommunikationsnetze / Multimedia Kommunikation – TU Darmstadt

Pratyush Agnihotri, M.Sc.

pratyush.agnihotri@kom.tu-...

0000-0001-9944-926X

Private Questions

Short Introduction

Pratyush Agnihotri joined the Master's course in Computer Science at TU Darmstadt in 2013. In his master thesis work, he proposed and developed a decentralized crowd-sensing data collection framework using Information-Centric Networking (ICN) concepts to deal with post-disaster challenges in communication. The thesis was done in joint collaboration with Prof. Dr.-Ing. Michael Zink, University of Massachusetts at Amherst, USA. After his graduation, he worked as a software developer at axxessio GmbH, Darmstadt in the application and research areas of Internet of Things (IoT), Smart City, Voice assistant. Since June 2021, he is working as a Research Scientist at the Multimedia Communication Lab and primarily associated with C2 subproject of DFG MAKI.

Research Interests

Distributed Parallel Stream Processing
Distributed Machine Learning
Autonomous Resource Management
Information-centric networking
Software-defined networking

Current Projects

DFG: Collaborative Research Center 1053 MAKI: I am working as Researcher for MAKI Subproject C2 which explores transitions in communication systems from an information-centric view. A central paradigm for information processing are event-based systems (EBS), which model the information flow as streams of events and help to recognize certain patterns over the event streams at runtime. The goal of C2 is to explore methods for transition-based adaptation of event-processing systems and thereby achieve a significant increase in the quality of service under dynamically varying conditions.

Parrot: Privacy Engineering for Real-Time Analytics in Human-Centered Internet of Things: I am working as Associated Researcher for Parrot project which is a collaborative research between the University of Oslo (Norway) and the Technical University of Darmstadt (Germany) funded by the Research Council of Norway (2020 – 2023). The aim of this project is to find user-centric methods for real-time privacy protection for stream processing applications.

Open Thesis/HiWi

I am looking for motivated students who are interested to work on cutting edge technologies and IoT applications area. If you are interested to write a Bachelor or Master Thesis or gain experience in MAKI project as HiWi then please feel free to contact me. Send your CV and Transcript or TUCaN grade list on my email address. Check open theses here.

Teaching Activities

semester	Course	role
Winter semester 2023/24	Communication network 2	Teaching assistant
Winter semester 2022/23	Communication network 2	Teaching assistant
Winter semester 2021/22	Communication network 2	Teaching assistant

Supervised Theses

student	topic	Thesis Type	semester	status
Shubham Sumalya	Benchmarking Parallel Stream Processing	Master	WS2023	Ongoing
Paul Stiegele	Learned Parallel Stream Processing	Master	WS2022	Completed
Chengbo Zhou	Cooperative ML-based NIDS between the data and control plane in SDN	Master	WS2022	Completed
William Laarakkers	Scalable and Efficient Processing of Financial Data using CEP	Bachelor (external RUG)	SS2022	Completed
Caroline Braams	Securing publish/subscribe in software-defined networks	Master (external RUG)	WS2021	Completed

Supervised Seminars/Labs/Projects

Students	topic	Type	semester	status
Rhivu	Auto-Scaling for Distributed Stream Processing using RL	MMC Lab	WS2324	Completed
Hamna (TU Ilmenau)	Efficient Memory Management for Large Language Model Serving with PagedAttention	DSOS Seminar	WS2324	Completed
Daniel (TU Ilmenau)	gSampler: General and Efficient GPU-based Graph Sampling for Graph Learning	DSOS Seminar	WS2324	Completed
Daniil (TU Ilmenau)	Parallel Stream Processing in Distributed Stream Processing Systems	DSOS Seminar	SS2023	Completed
Mouayad	Parallelizing Intra-Window Join using Hardware Accelerator	MMC seminar	SS2023	Completed
Mouaz, Jinyao, Zongdi	Parallel Stream Processing and Data Analytics using Flink	MMC Lab	SS2023	Completed
Chippy, Nishchay	Parallel Stream Processing using Reinforcement Learning	MMC seminar	WS2223	Completed
Mouayad, Ozcan	Real-time query validation and performance visualization	MMC Lab	WS2223	Completed
Sravya, Gayatri, Harsha	Big Data Analysis and Performance Evaluation	MMC Lab	WS2223	Completed
Aastha, Sushimitha	Learned Operator Parallelization and Stream Processing	MMC seminar	SS2022	Completed
Chengbo, Xiangyu, Xingzhou	Network Security Monitoring in SDN	MMC Lab	SS2022	Completed
Anton	Performance Evaluation of Windows Join in Stream Processing System	MMC Project	SS2022	Completed
Sushimitha, Christian	Query processing and performance analysis using Flink	MMC Lab	SS2022	Completed
Marvin	Hardware Accelerated Stream Processing	MMC seminar	WS2021	Completed
Ali	Resources Elasticity and Computation Offloading Techniques in Stream Processing Systems	MMC seminar	WS2021	Completed
corinna	Telcaria Alviu: Network Traffic Monitoring and Profiling	MMC Lab	WS2021	Completed

Proseminar ETiT

Proseminar Electrical Engineering and Information Technology
TUCaN course ID: 18-sm-1000-ps
Organization: Pratyush Agnihotri

Offene Abschlussarbeiten

Supervisor: Pratyush Agnihotri

Link zur Ausschreibung

Motivation:

Internet of Things describes a new world of heterogeneous objects such as sensors, smartphones, actuators etc, which intelligently interacts and communicate with each other and offer a better quality of life. With the increase of these objects, the data volume is growing as well. Processing this volume of data can be helpful to various application domains to reveal complex patterns, gain fast insights or react to observed situations, e.g., fraud detection, smart factories, telecommunication, etc. However, it becomes extremely challenging to continuously process this high volume of data in a timely manner and meet these applications' demands. One of the possible ways can be to use heterogeneous resources' capability such as GPUs, FPGAs, etc. to accelerate the big data analytics instead of relying on homogeneous resources, e.g., CPU.

Task:

Analyze existing related literature.
Formalizing the problem and developing the solution and
Setting up development environment and implementing the proof of concept or prototype.
Evaluation of the proposed solution in comparison to the existing approaches. .

Prerequisites:

There are no hard guidelines for this topic. However, it is preferable if you have:

Good programming skills in Java and/or Scala.
Understanding of the concept of .

communication networks and .
big data processing engines, e.g., Apache Flink, Kafka.

Contact:

I am looking for motivated students interested in working on cutting-edge technologies and IoT applications area. If you are interested in writing a Bachelor's or Master's Thesis or gaining experience in MAKI project as HiWi, please feel free to contact me. You can email me the following information:

Your CV or small text about your courses and prior experiences in this area.
The short note on your motivation about this topic and,
Transcript or TUCaN grade list.

Supervisor: Pratyush Agnihotri

Link zur Ausschreibung

Motivation

In Distributed Stream Processing Systems (DSPS), queries are usually long-running. They typically deal with a very high workload of million or even billion of events per second. Under such scenarios, parallelism plays an important role in providing scalability for DSPS. One of the core
decisions here is to determine the right parallelization degree to process a high load of events and queries while ensuring high throughput and low latency requirements. In this context, parallelization can be controlled by defining the degree of parallelism for each operator in the operator graph.
The main questions we want to investigate here are -

What is the right parallelism degree for query processing? (what is the performance in terms of end-to-end latency for parallel stream processing? At what extend the model can be used - seen and unseen behaviour? What are the limitations of models? At which can be extended?)
How to parallelize the processing of DSPs operators?

Prerequisites

There are no hard guidelines for this topic. However, it is preferable if you have:

Good programming skills in Java and Python
Good understanding of the concepts of machine learning
Understanding of the concept of communication networks, big data processing engines, e.g., Apache Flink, Kafka.

Reference Literature

Three steps is all you need: fast, accurate, automatic scaling decisions for distributed streaming dataflows.
A Comprehensive Survey on Parallelization and Elasticity in Stream Processing
Resource Management and Scheduling in Distributed Stream Processing Systems: A Taxonomy, Review, and Future Directions
One Model to Rule them All: Towards Zero-Shot Learning for Databases Benjamin
Apache Flink documentation: nightlies.apache.org/flink/flink-docs-master/docs/learn-flink/overview/ [https://nightlies.apache.org/flink/flink-docsmaster/docs/learn-flink/overview/]

Contact:

Your CV or small text about your courses and prior experiences in this area.
The short note on your motivation about this topic and,
Transcript or TUCaN grade list.

Supervisor: Pratyush Agnihotri

Link zur Ausschreibung

Distributed Stream Processing Systems (DSPS) allow real-time processing of data to fetch meaningful information. Big giants like Twitter, Facebook, and Alibaba rely on DSP for real-time data analytics. For example, click analytics, credit card fraud detection, etc. In DSPS, queries are usually long-running. They typically deal with a very high workload of millions or even billions of events per second. Under such scenarios, parallelism plays an important role in providing scalability for DSPS where either the input stream is partitioned into multiple processes or sub-streams or duplicated to be processed by operators and their instances. However, existing DSPS lacks the benchmarking of the parallel processing capability of operators under various workloads, event rates, and different queries.

Research Goals:

Literature study of benchmarking parallel stream processing.
Implementing Window and Pane-based parallelization strategy.
Evaluating the performance for various benchmarks
Displaying the performance evaluation result on the front end.
Possible DSPS to be used: Apache Flink

Veröffentlichungen

Alle Veröffentlichungen anzeigen

Pratyush Agnihotri

Pratyush Agnihotri, M.Sc.

Short Introduction

Research Interests

Current Projects

Open Thesis/HiWi

Teaching Activities

Supervised Theses

Supervised Seminars/Labs/Projects

Proseminar ETiT

Offene Abschlussarbeiten

Accelerating Big Data Analytics using Heterogeneous Resources
Bachelor/Master Thesis

Motivation:

Task:

Prerequisites:

Contact:

Learned Parallel Stream Processing using Zero-Shot Cost Models
Bachelor/Master Thesis

Motivation

Prerequisites

Reference Literature

Contact:

Benchmarking Parallel and Heterogeneous stream data processing systems
Bachelor/Master Thesis

Veröffentlichungen

Pratyush Agnihotri, M.Sc.

Short Introduction

Research Interests

Current Projects

Open Thesis/HiWi

Teaching Activities

Supervised Theses

Supervised Seminars/Labs/Projects

Proseminar ETiT

Offene Abschlussarbeiten

Accelerating Big Data Analytics using Heterogeneous Resources Bachelor/Master Thesis

Motivation:

Task:

Prerequisites:

Contact:

Learned Parallel Stream Processing using Zero-Shot Cost Models Bachelor/Master Thesis

Motivation

Prerequisites

Reference Literature

Contact:

Benchmarking Parallel and Heterogeneous stream data processing systems Bachelor/Master Thesis

Veröffentlichungen

Accelerating Big Data Analytics using Heterogeneous Resources
Bachelor/Master Thesis

Learned Parallel Stream Processing using Zero-Shot Cost Models
Bachelor/Master Thesis

Benchmarking Parallel and Heterogeneous stream data processing systems
Bachelor/Master Thesis