Welcome to the Laboratory of Algorithms and Data Analysis (LAAD) at the National University of San Antonio Abad del Cusco (UNSAAC)!

At LAAD, located in the Professional School of Computer Engineering and Systems, we are dedicated to the research and application of advanced algorithms and data analysis.

Explore our site to discover exciting projects, events, and opportunities to contribute to the advancement of science and technology.

Join us in exploring new frontiers in computer science and data science at LAAD UNSAAC!

The Data and Algorithms analysis Lab at UNSAAC (in spanish, Laboratorio de Algoritmos y Análisis de Datos - LAAD) is located at the National University of San Antonio Abad del Cusco, Peru as part of the Informatics Professional School.

The members of the laboratory is conformed by both very experienced and also young researchers that work together as a team in a number of ongoing projects in order to solve many local problems with the help of Computer Science. There is a wide range of research areas in which many of our members are currently working on; including Theory of Computation, Data Analysis, Bioinformatics, Combinatorics, etc. (see the Research Areas section).

Although our main activity is related to Computer Science, we encourage anyone who is interested in making scientific or technological production to contact us and join our team to make great things happen!

"Computer science is no more about computers than astronomy is about telescopes."

- Edsger W. Dijkstra

Sources

Undergraduate thesis templates in Latex

Plantilla de plan de tesis (click to see more)

The template is constructed based on the *official guide of the professional School of Computer Science - UNSAAC.
Links:

download template.zip Go to overleaf YouTube video

Plantilla de tesis (click to see more)

The code is constructed based on a template and consulting the book: "Thesis Projects, A Guide for Students in Computer Science and Information Systems by M. Berndtsson and B. Olsson, second edition."
Links:

download template.zip Go to overleaf YouTube video

Projects

Identification of risk areas and mobility patterns in archaeological parks in the Cusco region based on geolocated photos from Flickr.

Description: The present work aims to determine risk areas and mobility patterns in the archaeological parks of the Cusco region. To achieve this objective, we will analyze data associated with photographs shared on the Flickr social network, including the date, time, and precise geolocation of each photo. The process begins with collecting data through the Flickr API, followed by organizing, processing, and storing it in a…

Grant: Universidad Nacional de San Antonio Abad del Cusco

Date: 2018-2022

Total funding amount: PEN 496,554

Development of a virtual assistant to provide information and collect data in the areas of health, education, and economy as a mitigation measure for COVID-19

Description: The lack of a database to make decisions on quick actions during the pandemic caused by COVID-19 showed the need to use new technologies to speed up the process of capturing decentralized information. This article presents a virtual assistant (chatbot) called SaminBot as an alternative to collect data and provide information during the COVID-19 pandemic. Applied in the Cusco region of Peru, this chatbot facilitated conversations on health, economy, and education from January to August 2020. SaminBot begins data collection based on the user's area of interest, obtains demographic information of it and guides you through questions with the intention of providing useful information according to your personal situation. Specialists validated the questionnaires within their respective fields. We began the data collection process in January and concluded it in June 2021. We utilized the WhatsApp and Facebook Messenger platforms as well as the website, resulting in the acquisition of 1586 records.

Grant: Universidad Nacional de San Antonio Abad del Cusco

Date: 2020-2021

Total funding amount: PEN 50,000

Awards & Honors

Here, you'll find our awards listed in reverse chronological order. If you have any questions or would like to discuss a specific publication, feel free to contact us.

2024

Identification of climate change and sustainability texts using pre-trained Spanish language models

GH Robles, G Zuñiga-Rojas.

Best Paper at LatinX in AI workshop.

2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics.

conference paper

2023

Multivariable-Unistep Prediction of Travel Times in Public Transport Buses Using LSTM and Convolutional LSTM

Elizon F. Carcausto-Mamani, Etson R. Rojas-Cahuana, Edwin Alvarez-Mamani & Harley Vera-Olivera.

In SIMBig 2023.

conference paper

2022

I see you: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras

Hanan Quispe, Jorshinno Sumire, Patricia Condori, Edwin Alvarez, Harley Vera.

At LXAI workshop at NeurIPS 2022.

Arxiv - cs

2021

Trajectory anomaly detection based on similarity analysis

GF Quispe-Torres, G Garcia-Zanabria, H Vera-Olivera, L Enciso-Rodas.

Best Paper at 2021 XLVII Latin American Computing Conference (CLEI), 1-10.

IEEExplore

2020

Parallel Social Spider Optimization Algorithms with Island Model for the Clustering Problem

Edwin Alvarez-Mamani, Lauro Enciso-Rodas, Mauricio Ayala-Rincón & José L. Soncco-Álvarez .

At SIMBig 2020

Conference Paper

2019

A multi-objective optimization algorithm for center-based clustering

Jared León, Boris Chullo-Llave, Lauro Enciso-Rodas and José Luis Soncco-Álvarez.

Best Paper At CLEI 2019

Electronic Notes in Theoretical Computer Science

Publications

1. "Social spider algorithm approach for clustering"

Autors: Harley Vera-Olivera, José Luis Soncco-Álvarez, Lauro Enciso-Rodas

Abstract:

Clustering is a popular data analysis technique to identify homogeneous groups of objects based on the values of their attributes, used in many disciplines and applications. This extended abstract of our undergraduate thesis for obtaining the engineer degree in informatics and systems, presents an approach

based on the Social Spider Optimization(SSO) algorithm for optimizing clusters of data, taking as metric the sum of euclidean distances. Other important algorithms of the literature were implemented in order to make comparisons: K-means algorithm, and a Genetic Algorithm (GA) for Clustering. Experiments were performed using 5 datasets taken from the UCI Machine Learning Repository, each algorithm was executed many times and then the following measures were calculated: mean, median, minimum, and maximum values of the results. These experiments showed that the SSO algorithm outperforms the K- means algorithm, and it has results equally competitive as the GA. All these results were confirmed by statistical tests performed over the outputs of the algorithm.

Read less...

2. "Trajectory Anomaly Detection based on Similarity Analysis"

Autors: Gerar F. Quispe-Torres, Germain Garcia-Zanabria, Harley Vera-Olivera & Lauro Enciso-Rodas

Abstract:

Automatic trajectory processing has multiple applications, mainly due to the wide availability of the data. Trajectory data have a significant practical value, making possible the modeling of various problems such as surveillance and tracking devices, detect anomaly trajectories, identifying illegal and

adverse activity. In this study, we show a comparative analysis of the performance of two descriptors to detect anomaly trajectories. We define Wavelet and Fourier transforms as trajectory descriptors to generate characteristics and subsequently detect anomalies. The experiments emphasize performance in the description in the coefficient feature space. For that, we used unsupervised learning, specifically clustering techniques, to generate subsets and identify which are irregular. The implications of the study demonstrate that it is possible to use descriptors in trajectories for automatic anomaly detection and the use of unsupervised learning methods that automatically segment the required information. The performance and comparative analysis of our study are demonstrated through experiments and a case study considering synthetic and real data sets that leave evidence of our contribution.

Read Less...

3. "Clustering Analysis for Traffic Jam Detection for Intelligent Transportation System"

Autors: Harley Vera-Olivera, José Luis Soncco-Álvarez and Edwin Alvarez-Mamani

Abstract:

The growth of cities and the mobility of their inhabitants often generate traffic jams. In order to diminish this problem, cities began to implement Intelligent Transportation Systems (ITS), such as real-time control and monitoring of public transportation buses, smart traffic lights, mobile

applications to inform passengers. ITS are a reliable source of data collection for further analysis. This research aims to perform a clustering analysis to detect traffic jams, for which the Parallel Social Spider Optimization (P-SSO) algorithm was used. The choice of the P-SSO algorithm was made based on the previous results where the P-SSO algorithm was compared with the K-means and Social Spider Optimization (SSO) algorithms to solve clustering problems. On the first stage of this research, the P-SSO algorithm was used to define traffic jam states. In the second stage, the P-SSO algorithm was used to detect traffic jam areas based on the geospatial data generated by public transport buses in Cusco-Perú. This clustering analysis was performed on the main streets of the city between June and August 2019. Further, an application was developed that allows to check graphically the clusters obtained.

Read Less...

link to pdf or webpage

4. "I see you: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras"

Autors: Hanan Quispe, Jorshinno Sumire, Patricia Condori, Edwin Alvarez, Harley Vera

Abstract:

The development of autonomous vehicles arises new challenges in urban traffic scenarios where vehicle-pedestrian interactions are frequent e.g. vehicle yields to pedestrians, pedestrian slows down due approaching to the vehicle. Over the last years, several datasets have been developed to model these

interactions. However, available datasets do not cover near-accident scenarios that our dataset covers. We introduce I see you, a new vehicle-pedestrian interaction dataset that tackles the lack of trajectory data in near-accident scenarios using YOLOv5 and camera calibration methods. I see you consist of 170 near-accident occurrences in seven intersections in Cusco-Peru. This new dataset and pipeline code are available on GitHub.

Read Less...

link to pdf or webpage

5. "Feature-based Trajectory Anomaly Detection"

Autors: Gerar F. Quispe-Torres, Lauro Enciso-Rodas, Germain Garcia-Zanabria, Harley Vera

Abstract:

The high availability of trajectory data in different fields makes it attractive to analyze and enhance its multiple practical applications. In particular, trajectory anomaly detection has a significant practical value, making it possible identifying trajectories that may indicate illegal and adverse activity

in diverse areas such as surveillance, tracking devices, traffic, and people flow. This study presents a methodology to detect anomaly trajectories based on their morphology features. For that, we follow two stages: (1) comparative analysis of the performance of two descriptors to group similar trajectories, and (2) trajectory anomaly detection based on their similarities. We define Wavelet and Fourier transforms as trajectory descriptors to generate characteristics and subsequently detect anomalies. Our experiments emphasize the measure of the performance in the description of the coefficient feature space using unsupervised learning, specifically clustering techniques, to create subsets and identify irregular ones. The study's implications demonstrate that it is possible to use descriptors in trajectories for automatic anomaly detection and the use of unsupervised learning to segment required information. Our study's performance and comparative analysis have been demonstrated throughout multiple experiments. We present some quantitative results using synthetic data sets as well as qualitative analysis throughout two case studies considering real data sets that leave evidence of our contribution.

Read Less...

link to pdf or webpage

6. "Towards accurate building recognition using convolutional neural networks"

Autors: Jeanfranco D. Farfan-Escobedo, Lauro Enciso-Rodas, John E. Vargas-Muñoz

Abstract:

Building recognition from images is a challenging task since pictures can be taken from different angles and under different illumination conditions. Most of the building recognition methods use local and global handcrafted image features and do not consider the rejection scenario, where the method have to

be capable of identifying if a given image does not belong to any of the classes of interest. We propose a method based on convolutional neural networks that obtain effective feature vectors to perform accurate classification of buildings. Additionally, we analyze and propose methods for the problem of classification with rejection.

Read Less...

link to pdf or webpage

7. "Software as a service google apps in the internal communication of the national university of san antonio abad del cusco"

AUTORS: Hernán Nina, Lauro Enciso

ABSTRACT:

The internal communication in public universities is an unexplored field, however the previous investigations considered to internal communication as a strategic element and essential to the success of an organization. You can improve internal communication using digital channels that enhance and

simplify the variety of traditional channels. The objective of the research is determined what extent applications in the cloud of Google Apps for Education improve internal communication between members of the university community. The results of the research reveal that the use of the Software as a Service Google Apps does not substantially improve internal communication among members of the university community. However, we have identified some factors that prevent the use of Google Apps for Education, such as limitations in computer infrastructure, low level of knowledge in computing, limited use of digital tools in academic activities and the absence of a strategic plan for internal communication.

Read Less...

link to pdf or webpage

8. "Application of convolutional neural networks for static hand gestures recognition under different invariant features"

Autors: C. Jose L. Flores, A. E. Gladys Cutipa, R. Lauro Enciso

Abstract:

The present work proposes to recognize the static hand gestures taken under invariations features as scale, rotation, translation, illumination, noise and background. We use the alphabet of sign language of Peru (LSP). For this purpose, digital image processing techniques are used to eliminate or reduce

noise, to improve the contrast under a variant illumination, to separate the hand from the background of the image and finally detect and cut the region containing the hand gesture. We use of convolutional neural networks (CNN) to classify the 24 hand gestures. Two CNN architectures were developed with different amounts of layers and parameters per layer. The tests showed that the first CNN has an accuracy of 95.37% and the second CNN has an accuracy of 96.20% in terms of recognition of the 24 static hand gestures using the database developed. We compared the two architectures developed in accuracy level for each type of invariance presented in this paper. We compared the two architectures developed and usual techniques of machine learning in results of accuracy.

Read Less...

link to pdf or webpage

9. "Peruvian Sign Language Recognition Using a Hybrid Deep Neural Network"

Autors: Yuri Vladimir Huallpa Vargas, Naysha Naydu Diaz Ccasa & Lauro Enciso Rodas

Abstract:

Hearing impaired people have the ability to communicate with their hands and interpret sign language (SL), but this builds a communication gap with normal people. There are models for SL recognition that have images sequence RGB as input; however, the movements of the body in 3D space is

necessary to consider due to the complexity of the gestures. We built a model for Peruvian sign language recognition (PSL) to Spanish composed by 4 phases; first, the preprocessing phase in charge to process RGB, depth and skeleton streams obtained through the Kinect sensor v.1; second, the feature extraction which learn spatial information through 3 types of convolutional neural network (CNN); third, the bidirectional long short term memory (BLSTM) with residual connections in charge to reduced and encode the information. Finally, a decoder with attention mechanism and maxout network which learn the temporal information. Our proposed model is evaluated in LSA64 and ourself-built dataset. The experimental results show significant improvement compared to other models evaluated in these dataset .

Read Less...

link to pdf or webpage

10. "Multi-Objective Optimization Algorithm for Center-Based Clustering"

Autors: Jared Leon, Boris Chullo-Llave, Lauro Enciso-Rodas, Jose Luis Soncco-Alvarez

Abstract:

Center-based clustering is a set of clustering problems that require finding a single element, a center, to represent an entire cluster. The algorithms that solve this type of problems are very efficient for clustering large and high-dimensional datasets. In this paper, we propose a similar heuristic used in

Lloyd’s algorithm to approximately solve (EMAX algorithm) a more robust variation of the k-means problem, namely the EMAX problem. Also, a new center-based clustering algorithm (SSO-C) is proposed, which is based on a swarm intelligence technique called Social Spider Optimization. This algorithm minimizes a multi-objective optimization function defined as a weighted combination of the objective functions of the k-means and EMAX problems. Also, an approximation algorithm for the discrete k-center problem is used as a local search strategy for initializing the population. Results of the experiments showed that SSO-C algorithm is suitable for finding maximum best values, however EMAX algorithm is better in finding median and mean values.

Read Less...

link to pdf or webpage

11. "Parallel Social Spider Optimization Algorithms with Island Model for the Clustering Problem"

Autors: Edwin Alvarez-Mamani, Lauro Enciso,Mauricio Ayala-Rincon, José Luis Soncco-Álvarez

Abstract:

The digital age came with an extraordinary ability to generate data across organizations, people, and devices, data that needs to be analyzed, processed and stored. A well-known technique for analyzing this kind of data is Clustering. Many bio-inspired algorithms were proposed for this problem such as the

Social Spider Optimization (SSO). In this work, we propose parallel island models of the SSO algorithm for the Clustering problem, using 24 processors for each parallel algorithm. Such models were implemented using static and dynamic topologies, and datasets from the UCI Machine Learning Repository used for the stage of experiments. The achieved average speedups range from 15 to 28 times faster than the SSO algorithm for large and small datasets, respectively, and a parallel model with static ring topology performs a little bit faster than the other parallel models. The parallel algorithms provide results with similar precision to the ones computed with the SSO algorithm.

Read Less...

link to pdf

Members

Professors

Dr. Lauro Enciso Rodas
Algorithms

.
Dr. Hans Harley Ccacyahuillca Bejar
Computational Vision

.
Dr. Harley Vera Olivera
Databases,Data model

CTI Vitae
Mag. Gabriela Zuniga Rojas
Natural Language Processing (NLP)

.

Mag. Gerar Francis Quispe Torres
Procesamiento de Datos, Detección de Anomálias

CTI Vitae
Mag. Edwin Alvarez
Machine Learning, Parallel Computer

CTI Vitae
Dr. Fernando Montoya Cubas
Redes Complejas, Redes de Interacción Genética

.
nombre
Especialidad

.

Students

Ibeth Janela del Pilar Cusi

.
Johan Mihail Conde Sallo

.
Yasumy Maricely Jallo Paccaya
Especialidad

.
Thalia Quispe Mamani
Especialidad

.
Jadyra Ch'aska Choque Quispe
Especialidad

.
Rebeca Araceli Ccansaya Soncco
Especialidad

.

Ex Students

Mag. Edwin Alvarez Mamani
Ph. D student at PUCP, Perú

CTI Vitae
Mag. Jared León Malpartida
Ph. D student at Warwick, England

.
bach. Milagros Yarahuaman Rojas
MSc student at USP, Brasil

.
nombre
Especialidad

.
nombre
Especialidad

.
nombre
Especialidad

.

Sources

Undergraduate thesis templates in Latex

Plantilla de plan de tesis (click to see more)

Plantilla de tesis (click to see more)

Projects

Identification of risk areas and mobility patterns in archaeological parks in the Cusco region based on geolocated photos from Flickr.

Development of a virtual assistant to provide information and collect data in the areas of health, education, and economy as a mitigation measure for COVID-19

Awards & Honors

2024

Identification of climate change and sustainability texts using pre-trained Spanish language models

2023

Multivariable-Unistep Prediction of Travel Times in Public Transport Buses Using LSTM and Convolutional LSTM

2022

I see you: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras

2021

Trajectory anomaly detection based on similarity analysis

2020

Parallel Social Spider Optimization Algorithms with Island Model for the Clustering Problem

2019

A multi-objective optimization algorithm for center-based clustering

Publications

1. "Social spider algorithm approach for clustering"

2. "Trajectory Anomaly Detection based on Similarity Analysis"

3. "Clustering Analysis for Traffic Jam Detection for Intelligent Transportation System"

4. "I see you: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras"

5. "Feature-based Trajectory Anomaly Detection"

6. "Towards accurate building recognition using convolutional neural networks"

7. "Software as a service google apps in the internal communication of the national university of san antonio abad del cusco"

8. "Application of convolutional neural networks for static hand gestures recognition under different invariant features"

9. "Peruvian Sign Language Recognition Using a Hybrid Deep Neural Network"

10. "Multi-Objective Optimization Algorithm for Center-Based Clustering"

11. "Parallel Social Spider Optimization Algorithms with Island Model for the Clustering Problem"

Members

Professors

Dr. Lauro Enciso Rodas

Dr. Hans Harley Ccacyahuillca Bejar

Dr. Harley Vera Olivera

Mag. Gabriela Zuniga Rojas

Mag. Gerar Francis Quispe Torres

Mag. Edwin Alvarez

Dr. Fernando Montoya Cubas

nombre

Students

Ibeth Janela del Pilar Cusi

Johan Mihail Conde Sallo

Yasumy Maricely Jallo Paccaya

Thalia Quispe Mamani

Jadyra Ch'aska Choque Quispe

Rebeca Araceli Ccansaya Soncco

Ex Students

Mag. Edwin Alvarez Mamani

Mag. Jared León Malpartida

bach. Milagros Yarahuaman Rojas

nombre

nombre

nombre

Contact Us

Email: