Pratyush Agnihotri, M.Sc.

Pratyush Agnihotri

Short Introduction

Pratyush Agnihotri joined the Master's course in Computer Science at TU Darmstadt in 2013. In his master thesis work, he proposed and developed a decentralized crowd-sensing data collection framework using Information-Centric Networking (ICN) concepts to deal with post-disaster challenges in communication. The thesis was done in joint collaboration with Prof. Dr.-Ing. Michael Zink, University of Massachusetts at Amherst, USA. After his graduation, he worked as a software developer at axxessio GmbH, Darmstadt in the application and research areas of Internet of Things (IoT), Smart City, Voice assistant. Since June 2021, he is working as a Research Scientist at the Multimedia Communication Lab and primarily associated with  C2 subproject of DFG MAKI.

 

Research Interests

  • Distributed Parallel Stream Processing
  • Distributed Machine Learning 
  • Autonomous Resource Management
  • Information-centric networking
  • Software-defined networking

 

Current Projects

DFG: Collaborative Research Center 1053 MAKI:  I am working as Researcher for MAKI Subproject C2 which explores transitions in communication systems from an information-centric view. A central paradigm for information processing are event-based systems (EBS), which model the information flow as streams of events and help to recognize certain patterns over the event streams at runtime. The goal of C2 is to explore methods for transition-based adaptation of event-processing systems and thereby achieve a significant increase in the quality of service under dynamically varying conditions.

Parrot: Privacy Engineering for Real-Time Analytics in Human-Centered Internet of Things:  I am working as Associated Researcher for Parrot project which is a collaborative research between the University of Oslo (Norway) and the Technical University of Darmstadt (Germany) funded by the Research Council of Norway (2020 – 2023). The aim of this project is to find user-centric methods for real-time privacy protection for stream processing applications.

 


Open Thesis/HiWi

I am looking for motivated students who are interested to work on cutting edge technologies and IoT applications area. If you are interested to write a Bachelor or Master Thesis or gain experience in MAKI project as HiWi then please feel free to contact me. Send your CV and Transcript or TUCaN grade list on my email address. Check open theses here. 

 


Teaching Activities

 
semester Course role
Winter semester 2023/24 Communication network 2 Teaching assistant
Winter semester 2022/23 Communication network 2 Teaching assistant
Winter semester 2021/22 Communication network 2 Teaching assistant

 

Supervised Theses

 
 student topic Thesis Type semester status
Shubham Sumalya Benchmarking Parallel Stream Processing Master WS2023 Ongoing
Paul Stiegele Learned Parallel Stream Processing Master WS2022 Completed
Chengbo Zhou Cooperative ML-based NIDS between the data and control plane in SDN Master WS2022 Completed
William Laarakkers Scalable and Efficient Processing of Financial Data using CEP Bachelor (external RUG) SS2022 Completed
Caroline Braams Securing publish/subscribe in software-defined networks Master (external RUG) WS2021 Completed

 

Supervised Seminars/Labs/Projects

Students topic Type semester status

Rhivu

Auto-Scaling for Distributed Stream Processing using RL

MMC Lab

WS2324 Completed

Hamna (TU Ilmenau)

Efficient Memory Management for Large Language Model Serving with PagedAttention

DSOS Seminar

WS2324 Completed

Daniel (TU Ilmenau)

gSampler: General and Efficient GPU-based Graph Sampling for Graph Learning

DSOS Seminar

WS2324 Completed
Daniil (TU Ilmenau) Parallel Stream Processing in Distributed Stream Processing Systems

DSOS Seminar

SS2023 Completed
Mouayad Parallelizing Intra-Window Join using Hardware Accelerator MMC seminar SS2023 Completed
Mouaz, Jinyao, Zongdi Parallel Stream Processing and Data Analytics using Flink MMC Lab SS2023 Completed
Chippy, Nishchay Parallel Stream Processing using Reinforcement Learning MMC seminar WS2223 Completed
Mouayad, Ozcan Real-time query validation and performance visualization MMC Lab WS2223 Completed
Sravya, Gayatri, Harsha Big Data Analysis and Performance Evaluation MMC Lab WS2223 Completed
Aastha, Sushimitha Learned Operator Parallelization and Stream Processing MMC seminar SS2022 Completed
Chengbo, Xiangyu, Xingzhou Network Security Monitoring in SDN MMC Lab SS2022 Completed
Anton Performance Evaluation of Windows Join in Stream Processing System MMC Project SS2022 Completed
Sushimitha, Christian Query processing and performance analysis using Flink MMC Lab SS2022 Completed
Marvin Hardware Accelerated Stream Processing MMC seminar WS2021 Completed
Ali Resources Elasticity and Computation Offloading Techniques in Stream Processing Systems MMC seminar WS2021 Completed
corinna Telcaria Alviu: Network Traffic Monitoring and Profiling MMC Lab WS2021 Completed

 

Proseminar ETiT

Proseminar Electrical Engineering and Information Technology
TUCaN course ID: 18-sm-1000-ps
Organization: Pratyush Agnihotri


 

Offene Abschlussarbeiten

Supervisor: Pratyush Agnihotri
KOM-ID: KOM-M-0721
Link zur Ausschreibung

Motivation:

Internet of Things describes a new world of heterogeneous objects such as sensors, smartphones, actuators etc, which intelligently interacts and communicate with each other and offer a better quality of life. With the increase of these objects, the data volume is growing as well. Processing this volume of data can be helpful to various application domains to reveal complex patterns, gain fast insights or react to observed situations, e.g., fraud detection, smart factories, telecommunication, etc. However, it becomes extremely challenging to continuously process this high volume of data in a timely manner and meet these applications' demands. One of the possible ways can be to use heterogeneous resources' capability such as GPUs, FPGAs, etc. to accelerate the big data analytics instead of relying on homogeneous resources, e.g., CPU.

Task:

  • Analyze existing related literature.
  • Formalizing the problem and developing the solution and
  • Setting up development environment and implementing the proof of concept or prototype.
  • Evaluation of the proposed solution in comparison to the existing approaches. .

Prerequisites:

There are no hard guidelines for this topic. However, it is preferable if you have:

  • Good programming skills in Java and/or Scala.
  • Understanding of the concept of .
    • communication networks and .
    • big data processing engines, e.g., Apache Flink, Kafka.

Contact:

I am looking for motivated students interested in working on cutting-edge technologies and IoT applications area. If you are interested in writing a Bachelor's or Master's Thesis or gaining experience in MAKI project as HiWi, please feel free to contact me. You can email me the following information:

  • Your CV or small text about your courses and prior experiences in this area.
  • The short note on your motivation about this topic and,
  • Transcript or TUCaN grade list.
Supervisor: Pratyush Agnihotri
KOM-ID: KOM-M-0720
Link zur Ausschreibung

Motivation

 In Distributed Stream Processing Systems (DSPS), queries are usually long-running. They typically deal with a very high workload of million or even billion of events per second. Under such scenarios, parallelism plays an important role in providing scalability for DSPS. One of the core
decisions here is to determine the right parallelization degree to process a high load of events and queries while ensuring high throughput and low latency requirements. In this context, parallelization can be controlled by defining the degree of parallelism for each operator in the operator graph.
The main questions we want to investigate here are -
 

  •  What is the right parallelism degree for query processing? (what is the performance in terms of end-to-end latency for parallel stream processing? At what extend the model can be used - seen and unseen behaviour? What are the limitations of models? At which can be extended?)
  • How to parallelize the processing of DSPs operators? 

 Prerequisites

 There are no hard guidelines for this topic. However, it is preferable if you have:

  • Good programming skills in Java and Python
  • Good understanding of the concepts of machine learning
  • Understanding of the concept of communication networks, big data processing engines, e.g., Apache Flink, Kafka. 

 Reference Literature

  • Three steps is all you need: fast, accurate, automatic scaling decisions for distributed streaming dataflows.
  • A Comprehensive Survey on Parallelization and Elasticity in Stream Processing
  • Resource Management and Scheduling in Distributed Stream Processing Systems: A Taxonomy, Review, and Future Directions
  • One Model to Rule them All: Towards Zero-Shot Learning for Databases Benjamin
  • Apache Flink documentation: nightlies.apache.org/flink/flink-docs-master/docs/learn-flink/overview/ [https://nightlies.apache.org/flink/flink-docsmaster/docs/learn-flink/overview/] 

Contact:

I am looking for motivated students interested in working on cutting-edge technologies and IoT applications area. If you are interested in writing a Bachelor's or Master's Thesis or gaining experience in MAKI project as HiWi, please feel free to contact me. You can email me the following information:

  • Your CV or small text about your courses and prior experiences in this area.
  • The short note on your motivation about this topic and,
  • Transcript or TUCaN grade list.
Supervisor: Pratyush Agnihotri
KOM-ID: KOM-M-0770
Link zur Ausschreibung

Distributed Stream Processing Systems (DSPS) allow real-time processing of data to fetch meaningful information. Big giants like Twitter, Facebook, and Alibaba rely on DSP for real-time data analytics. For example, click analytics, credit card fraud detection, etc. In DSPS, queries are usually long-running. They typically deal with a very high workload of millions or even billions of events per second. Under such scenarios, parallelism plays an important role in providing scalability for DSPS where either the input stream is partitioned into multiple processes or sub-streams or duplicated to be processed by operators and their instances. However, existing DSPS lacks the benchmarking of the parallel processing capability of operators under various workloads, event rates, and different queries.

Research Goals:

  • Literature study of benchmarking parallel stream processing.
  • Implementing Window and Pane-based parallelization strategy.
  • Evaluating the performance for various benchmarks
  • Displaying the performance evaluation result on the front end.
  • Possible DSPS to be used: Apache Flink

Veröffentlichungen

Alle Veröffentlichungen anzeigen