About
25 years of diverse Engineering experience solving all kinds of problems with diverse…
Articles by Pedro
Activity
-
We’re thrilled to announce a partnership with Redwood Materials to deploy a pioneering battery energy storage system at our Normal, Illinois…
We’re thrilled to announce a partnership with Redwood Materials to deploy a pioneering battery energy storage system at our Normal, Illinois…
Liked by Pedro Larroy
-
A new fleet of Georgia-made robotaxis, featuring familiar, oblong headlights, is expected to hit streets in coming years. Uber plans to invest up to…
A new fleet of Georgia-made robotaxis, featuring familiar, oblong headlights, is expected to hit streets in coming years. Uber plans to invest up to…
Liked by Pedro Larroy
Experience
Education
-
Universitat Politècnica de Catalunya
Ingierniero superior de telecomunicación
-
Activities and Societies: Robotics and embedded software course during vacation. Printed circuit design and microchip PIC programming, spare time activities. Open source projects.
Plan 92: http://www.etsetb.upc.edu/info_sobre/estudis/pla_92/eng_telecos/
-
-
-
-
-
-
Publications
-
Fairness Measures for Machine Learning in Finance
PMR
See publicationThe authors present a machine learning pipeline for fairness-aware machine learning (FAML) in finance that encompasses metrics for fairness (and accuracy). Whereas accuracy metrics are well understood and the principal ones are used frequently, there is no consensus as to which of several available measures for fairness should be used in a generic manner in the financial services industry. The authors explore these measures and discuss which ones to focus on at various stages in the ML…
The authors present a machine learning pipeline for fairness-aware machine learning (FAML) in finance that encompasses metrics for fairness (and accuracy). Whereas accuracy metrics are well understood and the principal ones are used frequently, there is no consensus as to which of several available measures for fairness should be used in a generic manner in the financial services industry. The authors explore these measures and discuss which ones to focus on at various stages in the ML pipeline, pre-training and post-training, and they examine simple bias mitigation approaches. Using a standard dataset, they show that the sequencing in their FAML pipeline offers a cogent approach to arriving at a fair and accurate ML model. The authors discuss the intersection of bias metrics with legal considerations in the United States, and the entanglement of explainability and fairness is exemplified in the case study. They discuss possible approaches for training ML models while satisfying constraints imposed from various fairness metrics and the role of causality in assessing fairness.
-
Amazon SageMaker Clarify: Machine Learning Bias Detection and Explainability in the Cloud
ACM
See publicationUnderstanding the predictions made by machine learning (ML) models and their potential biases remains a challenging and labor-intensive task that depends on the application, the dataset, and the specific model. We present Amazon SageMaker Clarify, an explainability feature for Amazon SageMaker that launched in December 2020, providing insights into data and ML models by identifying biases and explaining predictions. It is deeply integrated into Amazon SageMaker, a fully managed service that…
Understanding the predictions made by machine learning (ML) models and their potential biases remains a challenging and labor-intensive task that depends on the application, the dataset, and the specific model. We present Amazon SageMaker Clarify, an explainability feature for Amazon SageMaker that launched in December 2020, providing insights into data and ML models by identifying biases and explaining predictions. It is deeply integrated into Amazon SageMaker, a fully managed service that enables data scientists and developers to build, train, and deploy ML models at any scale. Clarify supports bias detection and feature importance computation across the ML lifecycle, during data preparation, model evaluation, and post-deployment monitoring. We outline the desiderata derived from customer input, the modular architecture, and the methodology for bias and explanation computations. Further, we describe the technical challenges encountered and the tradeoffs we had to make. For illustration, we discuss two customer use cases. We present our deployment results including qualitative customer feedback and a quantitative evaluation. Finally, we summarize lessons learned, and discuss best practices for the successful adoption of fairness and explanation tools in practice.
-
Fairness Measures for Machine Learning in Finance
AWS
See publicationWe present a machine learning pipeline for
fairness-aware machine learning (FAML) in finance that encompasses metrics for fairness (and
accuracy). Whereas accuracy metrics are well understood and the principal ones used frequently,
there is no consensus as to which of several available measures for fairness should be used in a
generic manner in the financial services industry. We explore these measures and discuss which
ones to focus on, at various stages in the ML
pipeline…We present a machine learning pipeline for
fairness-aware machine learning (FAML) in finance that encompasses metrics for fairness (and
accuracy). Whereas accuracy metrics are well understood and the principal ones used frequently,
there is no consensus as to which of several available measures for fairness should be used in a
generic manner in the financial services industry. We explore these measures and discuss which
ones to focus on, at various stages in the ML
pipeline, pre-training and post-training, and we
also examine simple bias mitigation approaches.
Using a standard dataset we show that the sequencing in our FAML pipeline offers a cogent
approach to arriving at a fair and accurate ML
model. We discuss the intersection of bias metrics with legal considerations in the US, and the
entanglement of explainability and fairness is exemplified in the case study. We discuss possible
approaches for training ML models while satisfying constraints imposed from various fairness
metrics, and the role of causality in assessing fairness. -
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
Arxiv
See publicationWe introduce AutoGluon-Tabular, an open-source AutoML framework that requires only a single line of Python to train highly accurate machine learning models on an unprocessed tabular dataset such as a CSV file. Unlike existing AutoML frameworks that primarily focus on model/hyperparameter selection, AutoGluon-Tabular succeeds by ensembling multiple models and stacking them in multiple layers. Experiments reveal that our multi-layer combination of many models offers better use of allocated…
We introduce AutoGluon-Tabular, an open-source AutoML framework that requires only a single line of Python to train highly accurate machine learning models on an unprocessed tabular dataset such as a CSV file. Unlike existing AutoML frameworks that primarily focus on model/hyperparameter selection, AutoGluon-Tabular succeeds by ensembling multiple models and stacking them in multiple layers. Experiments reveal that our multi-layer combination of many models offers better use of allocated training time than seeking out the best. A second contribution is an extensive evaluation of public and commercial AutoML platforms including TPOT, H2O, AutoWEKA, auto-sklearn, AutoGluon, and Google AutoML Tables. Tests on a suite of 50 classification and regression tasks from Kaggle and the OpenML AutoML Benchmark reveal that AutoGluon is faster, more robust, and much more accurate. We find that AutoGluon often even outperforms the best-in-hindsight combination of all of its competitors. In two popular Kaggle competitions, AutoGluon beat 99% of the participating data scientists after merely 4h of training on the raw data.
-
Peer to peer synchronization using vector clocks and repository update clocks
github
See publicationA method is proposed to detect concurrent changes, conflicts and causality violations in a large set of data files or key-value pairs which are shared and synchronized across a cluster of compute nodes in which the wall clock is not necessarily synchronized. The method proposed also guarantees that on reconnection only the list of files or keys that have changed since the last synchronization is transmitted.
-
GMM Based multimodal biometric identification
http://www.enterface.net/enterface05/docs/results/reports/project5.pdf
Gaussian Mixture Model expectation maximization model for sensor fusion.
Other authors -
Patents
-
Map data compatibility processing architecture
Filed US US20180239828A1
Systems and methods are provided for executing a filter on map data. The filter receives a first notification that a version of first map data from a first map data source is available. The filter determines that the version of first map data is compatible using one or more version rules stored in the filter. The filter processes the version of first map data, when the version of first map data is compatible. The filter generates a second notification that a processed version of first map data…
Systems and methods are provided for executing a filter on map data. The filter receives a first notification that a version of first map data from a first map data source is available. The filter determines that the version of first map data is compatible using one or more version rules stored in the filter. The filter processes the version of first map data, when the version of first map data is compatible. The filter generates a second notification that a processed version of first map data is available.
Other inventorsSee patent -
Fresh hybrid routing independent of map version and provider
Issued US US9874451B2
See patentSystems, methods, and apparatuses are described for providing fresh hybrid routing independent of map version and provider. A set of routing data is received in response to a routing request. The set of routing data includes road segments. An analysis may be performed of a local map and the set of routing data. At least one unmatched road segment between the local map and the set of routing data is identified based on the analysis. A request for update data for the at least one unmatched road…
Systems, methods, and apparatuses are described for providing fresh hybrid routing independent of map version and provider. A set of routing data is received in response to a routing request. The set of routing data includes road segments. An analysis may be performed of a local map and the set of routing data. At least one unmatched road segment between the local map and the set of routing data is identified based on the analysis. A request for update data for the at least one unmatched road segment is made. Using the local map, the set of routing data, and the update data for the at least one unmatched road segment a navigation action is generated.
-
Method for identifying and diagnosing interferences in RF signals and particularly television signals
Filed EU EP2048801
See patentA method for identifying and diagnosing interferences in television signals, whether analogue or digital, in which, for each frequency, frequency interval (1, 2, 3, 4) or carrier wave (P1 to P9,...), there is a predetermined optimal level of a spectrum control variable (D, A), based on time (spectrum), such as for example, intensity, radiated power, MER, BER, etc.
Courses
-
Algorithms I (Coursera - Princeton)
-
-
Algorithms II (Coursera - Princeton)
-
-
Architecture of operating systems
11512
-
Artificial Intelligence (Udacity)
-
-
Bases de dades
M2009
-
Computational Finance
COMP510
-
Computational Investing, Part I
-
-
Concurrent programming
11520
-
Data mining
UOC
-
Data transmission, cryptography and cryptology
11557
-
Functional Programming Principles in Scala (Coursera)
-
-
Game theory (coursera)
-
-
Machine Learning (Andrew Ng)
-
-
Machine Learning From Data
230625
-
Microcontroller and PCB design for robotics
-
-
Networks and communication services
11522
-
Optical communications
11513
-
Principles of Reactive Programming
-
-
Radio communications
11521
-
Team skills traning and communication (Nokia)
-
Projects
-
mycelium search engine
- Present
See projectAn small web crawler open source information retrieval system written in C++ and Python. The next Google ;) .
-
MXNet
-
See projectLightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more https://mxnet.apache.org
-
Sudoku solver using uninformed search
-
See projectA very fast sudoku solver using uninformed graph search coded in really bad C+
Languages
-
Spanish
Native or bilingual proficiency
-
English
Native or bilingual proficiency
-
German
Limited working proficiency
-
Catalan
Professional working proficiency
Recommendations received
5 people have recommended Pedro
Join now to viewMore activity by Pedro
-
I needed to monitor GPU metrics across an AI cluster. Should be simple, right? Three tools, two plugins, a container runtime, and half a day later…
I needed to monitor GPU metrics across an AI cluster. Should be simple, right? Three tools, two plugins, a container runtime, and half a day later…
Liked by Pedro Larroy
-
Un año más, comprometidos con la ciberseguridad en Andalucía Participamos de nuevo en Málaga como empresa colaboradora en el Congreso de…
Un año más, comprometidos con la ciberseguridad en Andalucía Participamos de nuevo en Málaga como empresa colaboradora en el Congreso de…
Liked by Pedro Larroy
-
This time, I believe it truly is the last time. This is my last week at Meta, and I couldn't be more excited about what's next. I'll keep this…
This time, I believe it truly is the last time. This is my last week at Meta, and I couldn't be more excited about what's next. I'll keep this…
Liked by Pedro Larroy
-
Hagay Lupesko, SVP of AI Inference at Cerebras will be speaking at Xcelerated Compute hosted by DatacenterDynamics “All-In on Inference: The bold…
Hagay Lupesko, SVP of AI Inference at Cerebras will be speaking at Xcelerated Compute hosted by DatacenterDynamics “All-In on Inference: The bold…
Liked by Pedro Larroy
-
what else can a girl ask for, NVIDIA has sent me a DGX Spark for my upcoming birthday 😭💚 I will use it to label datasets and later train models…
what else can a girl ask for, NVIDIA has sent me a DGX Spark for my upcoming birthday 😭💚 I will use it to label datasets and later train models…
Liked by Pedro Larroy
-
Uber announced a massive new partnership with Rivian, investing over one billion dollars in the American automaker to deploy up to 50,000…
Uber announced a massive new partnership with Rivian, investing over one billion dollars in the American automaker to deploy up to 50,000…
Liked by Pedro Larroy
-
Today, we announced a partnership with Uber to help both companies accelerate their autonomous vehicle plans, building towards a scaled…
Today, we announced a partnership with Uber to help both companies accelerate their autonomous vehicle plans, building towards a scaled…
Liked by Pedro Larroy
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content