LAAD

Welcome to the Laboratory of Algorithms and Data Analysis (LAAD) at the National University of San Antonio Abad del Cusco (UNSAAC)!

At LAAD, located in the Professional School of Computer Engineering and Systems, we are dedicated to the research and application of advanced algorithms and data analysis.

Explore our site to discover exciting projects, events, and opportunities to contribute to the advancement of science and technology.

Join us in exploring new frontiers in computer science and data science at LAAD UNSAAC!

Logo

The Data and Algorithms analysis Lab at UNSAAC (in spanish, Laboratorio de Algoritmos y Análisis de Datos - LAAD) is located at the National University of San Antonio Abad del Cusco, Peru as part of the Informatics Professional School.

The members of the laboratory is conformed by both very experienced and also young researchers that work together as a team in a number of ongoing projects in order to solve many local problems with the help of Computer Science. There is a wide range of research areas in which many of our members are currently working on; including Theory of Computation, Data Analysis, Bioinformatics, Combinatorics, etc. (see the Research Areas section).

Although our main activity is related to Computer Science, we encourage anyone who is interested in making scientific or technological production to contact us and join our team to make great things happen!

"Computer science is no more about computers than astronomy is about telescopes."

- Edsger W. Dijkstra

Sources

Undergraduate thesis templates in Latex

Projects

Identification of risk areas and mobility patterns in archaeological parks in the Cusco region based on geolocated photos from Flickr.

Description: The present work aims to determine risk areas and mobility patterns in the archaeological parks of the Cusco region. To achieve this objective, we will analyze data associated with photographs shared on the Flickr social network, including the date, time, and precise geolocation of each photo. The process begins with collecting data through the Flickr API, followed by organizing, processing, and storing it in a…

Grant: Universidad Nacional de San Antonio Abad del Cusco

Date: 2018-2022

Total funding amount: PEN 496,554

Development of a virtual assistant to provide information and collect data in the areas of health, education, and economy as a mitigation measure for COVID-19

Description: The lack of a database to make decisions on quick actions during the pandemic caused by COVID-19 showed the need to use new technologies to speed up the process of capturing decentralized information. This article presents a virtual assistant (chatbot) called SaminBot as an alternative to collect data and provide information during the COVID-19 pandemic. Applied in the Cusco region of Peru, this chatbot facilitated conversations on health, economy, and education from January to August 2020. SaminBot begins data collection based on the user's area of interest, obtains demographic information of it and guides you through questions with the intention of providing useful information according to your personal situation. Specialists validated the questionnaires within their respective fields. We began the data collection process in January and concluded it in June 2021. We utilized the WhatsApp and Facebook Messenger platforms as well as the website, resulting in the acquisition of 1586 records.

Grant: Universidad Nacional de San Antonio Abad del Cusco

Date: 2020-2021

Total funding amount: PEN 50,000

Awards & Honors

Here, you'll find our awards listed in reverse chronological order. If you have any questions or would like to discuss a specific publication, feel free to contact us.


2024

Identification of climate change and sustainability texts using pre-trained Spanish language models

GH Robles, G Zuñiga-Rojas.

Best Paper at LatinX in AI workshop.

2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics.

conference paper

NAACL

2023

Multivariable-Unistep Prediction of Travel Times in Public Transport Buses Using LSTM and Convolutional LSTM

Elizon F. Carcausto-Mamani, Etson R. Rojas-Cahuana, Edwin Alvarez-Mamani & Harley Vera-Olivera.

In SIMBig 2023.

conference paper

Transit Network Analysis

2022

I see you: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras

Hanan Quispe, Jorshinno Sumire, Patricia Condori, Edwin Alvarez, Harley Vera.

At LXAI workshop at NeurIPS 2022.

Arxiv - cs

LXAI

2021

Trajectory anomaly detection based on similarity analysis

GF Quispe-Torres, G Garcia-Zanabria, H Vera-Olivera, L Enciso-Rodas.

Best Paper at 2021 XLVII Latin American Computing Conference (CLEI), 1-10.

IEEExplore

Transit Network Analysis

2020

Parallel Social Spider Optimization Algorithms with Island Model for the Clustering Problem

Edwin Alvarez-Mamani, Lauro Enciso-Rodas, Mauricio Ayala-Rincón & José L. Soncco-Álvarez .

At SIMBig 2020

Conference Paper

SIMBig 2020

2019

A multi-objective optimization algorithm for center-based clustering

Jared León, Boris Chullo-Llave, Lauro Enciso-Rodas and José Luis Soncco-Álvarez.

Best Paper At CLEI 2019

Electronic Notes in Theoretical Computer Science

CLEI 2019

Publications

1. "Social spider algorithm approach for clustering"

Autors: Harley Vera-Olivera, José Luis Soncco-Álvarez, Lauro Enciso-Rodas

Abstract:

Clustering is a popular data analysis technique to identify homogeneous groups of objects based on the values of their attributes, used in many disciplines and applications. This extended abstract of our undergraduate thesis for obtaining the engineer degree in informatics and systems, presents an approach

based on the Social Spider Optimization(SSO) algorithm for optimizing clusters of data, taking as metric the sum of euclidean distances. Other important algorithms of the literature were implemented in order to make comparisons: K-means algorithm, and a Genetic Algorithm (GA) for Clustering. Experiments were performed using 5 datasets taken from the UCI Machine Learning Repository, each algorithm was executed many times and then the following measures were calculated: mean, median, minimum, and maximum values of the results. These experiments showed that the SSO algorithm outperforms the K- means algorithm, and it has results equally competitive as the GA. All these results were confirmed by statistical tests performed over the outputs of the algorithm.

link to pdf or webpage

2. "Trajectory Anomaly Detection based on Similarity Analysis"

Autors: Gerar F. Quispe-Torres, Germain Garcia-Zanabria, Harley Vera-Olivera & Lauro Enciso-Rodas

Abstract:

Automatic trajectory processing has multiple applications, mainly due to the wide availability of the data. Trajectory data have a significant practical value, making possible the modeling of various problems such as surveillance and tracking devices, detect anomaly trajectories, identifying illegal and

adverse activity. In this study, we show a comparative analysis of the performance of two descriptors to detect anomaly trajectories. We define Wavelet and Fourier transforms as trajectory descriptors to generate characteristics and subsequently detect anomalies. The experiments emphasize performance in the description in the coefficient feature space. For that, we used unsupervised learning, specifically clustering techniques, to generate subsets and identify which are irregular. The implications of the study demonstrate that it is possible to use descriptors in trajectories for automatic anomaly detection and the use of unsupervised learning methods that automatically segment the required information. The performance and comparative analysis of our study are demonstrated through experiments and a case study considering synthetic and real data sets that leave evidence of our contribution.

link to pdf or webpage

3. "Clustering Analysis for Traffic Jam Detection for Intelligent Transportation System"

Autors: Harley Vera-Olivera, José Luis Soncco-Álvarez and Edwin Alvarez-Mamani

Abstract:

The growth of cities and the mobility of their inhabitants often generate traffic jams. In order to diminish this problem, cities began to implement Intelligent Transportation Systems (ITS), such as real-time control and monitoring of public transportation buses, smart traffic lights, mobile

applications to inform passengers. ITS are a reliable source of data collection for further analysis. This research aims to perform a clustering analysis to detect traffic jams, for which the Parallel Social Spider Optimization (P-SSO) algorithm was used. The choice of the P-SSO algorithm was made based on the previous results where the P-SSO algorithm was compared with the K-means and Social Spider Optimization (SSO) algorithms to solve clustering problems. On the first stage of this research, the P-SSO algorithm was used to define traffic jam states. In the second stage, the P-SSO algorithm was used to detect traffic jam areas based on the geospatial data generated by public transport buses in Cusco-Perú. This clustering analysis was performed on the main streets of the city between June and August 2019. Further, an application was developed that allows to check graphically the clusters obtained.

link to pdf or webpage

4. "I see you: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras"

Autors: Hanan Quispe, Jorshinno Sumire, Patricia Condori, Edwin Alvarez, Harley Vera

Abstract:

The development of autonomous vehicles arises new challenges in urban traffic scenarios where vehicle-pedestrian interactions are frequent e.g. vehicle yields to pedestrians, pedestrian slows down due approaching to the vehicle. Over the last years, several datasets have been developed to model these

interactions. However, available datasets do not cover near-accident scenarios that our dataset covers. We introduce I see you, a new vehicle-pedestrian interaction dataset that tackles the lack of trajectory data in near-accident scenarios using YOLOv5 and camera calibration methods. I see you consist of 170 near-accident occurrences in seven intersections in Cusco-Peru. This new dataset and pipeline code are available on GitHub.

link to pdf or webpage

5. "Feature-based Trajectory Anomaly Detection"

Autors: Gerar F. Quispe-Torres, Lauro Enciso-Rodas, Germain Garcia-Zanabria, Harley Vera

Abstract:

The high availability of trajectory data in different fields makes it attractive to analyze and enhance its multiple practical applications. In particular, trajectory anomaly detection has a significant practical value, making it possible identifying trajectories that may indicate illegal and adverse activity

in diverse areas such as surveillance, tracking devices, traffic, and people flow. This study presents a methodology to detect anomaly trajectories based on their morphology features. For that, we follow two stages: (1) comparative analysis of the performance of two descriptors to group similar trajectories, and (2) trajectory anomaly detection based on their similarities. We define Wavelet and Fourier transforms as trajectory descriptors to generate characteristics and subsequently detect anomalies. Our experiments emphasize the measure of the performance in the description of the coefficient feature space using unsupervised learning, specifically clustering techniques, to create subsets and identify irregular ones. The study's implications demonstrate that it is possible to use descriptors in trajectories for automatic anomaly detection and the use of unsupervised learning to segment required information. Our study's performance and comparative analysis have been demonstrated throughout multiple experiments. We present some quantitative results using synthetic data sets as well as qualitative analysis throughout two case studies considering real data sets that leave evidence of our contribution.

link to pdf or webpage

6. "Towards accurate building recognition using convolutional neural networks"

Autors: Jeanfranco D. Farfan-Escobedo, Lauro Enciso-Rodas, John E. Vargas-Muñoz

Abstract:

Building recognition from images is a challenging task since pictures can be taken from different angles and under different illumination conditions. Most of the building recognition methods use local and global handcrafted image features and do not consider the rejection scenario, where the method have to

be capable of identifying if a given image does not belong to any of the classes of interest. We propose a method based on convolutional neural networks that obtain effective feature vectors to perform accurate classification of buildings. Additionally, we analyze and propose methods for the problem of classification with rejection.

link to pdf or webpage

7. "Software as a service google apps in the internal communication of the national university of san antonio abad del cusco"

AUTORS: Hernán Nina, Lauro Enciso

ABSTRACT:

The internal communication in public universities is an unexplored field, however the previous investigations considered to internal communication as a strategic element and essential to the success of an organization. You can improve internal communication using digital channels that enhance and

simplify the variety of traditional channels. The objective of the research is determined what extent applications in the cloud of Google Apps for Education improve internal communication between members of the university community. The results of the research reveal that the use of the Software as a Service Google Apps does not substantially improve internal communication among members of the university community. However, we have identified some factors that prevent the use of Google Apps for Education, such as limitations in computer infrastructure, low level of knowledge in computing, limited use of digital tools in academic activities and the absence of a strategic plan for internal communication.

link to pdf or webpage

8. "Application of convolutional neural networks for static hand gestures recognition under different invariant features"

Autors: C. Jose L. Flores, A. E. Gladys Cutipa, R. Lauro Enciso

Abstract:

The present work proposes to recognize the static hand gestures taken under invariations features as scale, rotation, translation, illumination, noise and background. We use the alphabet of sign language of Peru (LSP). For this purpose, digital image processing techniques are used to eliminate or reduce

noise, to improve the contrast under a variant illumination, to separate the hand from the background of the image and finally detect and cut the region containing the hand gesture. We use of convolutional neural networks (CNN) to classify the 24 hand gestures. Two CNN architectures were developed with different amounts of layers and parameters per layer. The tests showed that the first CNN has an accuracy of 95.37% and the second CNN has an accuracy of 96.20% in terms of recognition of the 24 static hand gestures using the database developed. We compared the two architectures developed in accuracy level for each type of invariance presented in this paper. We compared the two architectures developed and usual techniques of machine learning in results of accuracy.

link to pdf or webpage

9. "Peruvian Sign Language Recognition Using a Hybrid Deep Neural Network"

Autors: Yuri Vladimir Huallpa Vargas, Naysha Naydu Diaz Ccasa & Lauro Enciso Rodas

Abstract:

Hearing impaired people have the ability to communicate with their hands and interpret sign language (SL), but this builds a communication gap with normal people. There are models for SL recognition that have images sequence RGB as input; however, the movements of the body in 3D space is

necessary to consider due to the complexity of the gestures. We built a model for Peruvian sign language recognition (PSL) to Spanish composed by 4 phases; first, the preprocessing phase in charge to process RGB, depth and skeleton streams obtained through the Kinect sensor v.1; second, the feature extraction which learn spatial information through 3 types of convolutional neural network (CNN); third, the bidirectional long short term memory (BLSTM) with residual connections in charge to reduced and encode the information. Finally, a decoder with attention mechanism and maxout network which learn the temporal information. Our proposed model is evaluated in LSA64 and ourself-built dataset. The experimental results show significant improvement compared to other models evaluated in these dataset .

link to pdf or webpage

10. "Multi-Objective Optimization Algorithm for Center-Based Clustering"

Autors: Jared Leon, Boris Chullo-Llave, Lauro Enciso-Rodas, Jose Luis Soncco-Alvarez

Abstract:

Center-based clustering is a set of clustering problems that require finding a single element, a center, to represent an entire cluster. The algorithms that solve this type of problems are very efficient for clustering large and high-dimensional datasets. In this paper, we propose a similar heuristic used in

Lloyd’s algorithm to approximately solve (EMAX algorithm) a more robust variation of the k-means problem, namely the EMAX problem. Also, a new center-based clustering algorithm (SSO-C) is proposed, which is based on a swarm intelligence technique called Social Spider Optimization. This algorithm minimizes a multi-objective optimization function defined as a weighted combination of the objective functions of the k-means and EMAX problems. Also, an approximation algorithm for the discrete k-center problem is used as a local search strategy for initializing the population. Results of the experiments showed that SSO-C algorithm is suitable for finding maximum best values, however EMAX algorithm is better in finding median and mean values.

link to pdf or webpage

11. "Parallel Social Spider Optimization Algorithms with Island Model for the Clustering Problem"

Autors: Edwin Alvarez-Mamani, Lauro Enciso,Mauricio Ayala-Rincon, José Luis Soncco-Álvarez

Abstract:

The digital age came with an extraordinary ability to generate data across organizations, people, and devices, data that needs to be analyzed, processed and stored. A well-known technique for analyzing this kind of data is Clustering. Many bio-inspired algorithms were proposed for this problem such as the

Social Spider Optimization (SSO). In this work, we propose parallel island models of the SSO algorithm for the Clustering problem, using 24 processors for each parallel algorithm. Such models were implemented using static and dynamic topologies, and datasets from the UCI Machine Learning Repository used for the stage of experiments. The achieved average speedups range from 15 to 28 times faster than the SSO algorithm for large and small datasets, respectively, and a parallel model with static ring topology performs a little bit faster than the other parallel models. The parallel algorithms provide results with similar precision to the ones computed with the SSO algorithm.

link to pdf

Members

Professors

Students

Ex Students

Contact Us

FACULTAD DE INGENIERÍA ELÉCTRICA, ELECTRÓNICA, INFORMÁTICA Y MECÁNICA


Escuela Profesional de Ingeniería Informática y de Sistemas, 4th floor Perayoc Campus,
Av. De La Cultura N° 733
Cusco, Cusco, Peru

Feel free to write us!

Email:



laad@unsaac.edu.pe