Houssem C.

DATA ENGINEER

1025 dollar
Freelancer
11 ans
Toulouse, FRANCE

Mon expérience

Voir plus

Air France-KLMDecember 2018 - Présent

- Management of HDP Hadoop clusters life cycle
- Architecture study and integration of new Hadoop services and components
- Monitoring data technologies advances
- Documentation of Hadoop cluster management procedures
- Support to the operation team
Voir plus

Grenoble INP - Institut polytechnique de GrenobleOctober 2018 - November 2018

Transfert of Technology from Grenoble INP to Enedis
Documentation of developped tools
Voir plus

Grenoble INP - Institut polytechnique de GrenobleDecember 2014 - June 2018

Enedis Industrial Chaire of Excellency on Smart Grids.

- Management of small group of interns and PhD students
- Participation in the research project setup process
- Design of a hybrid data lake architecture for managing smart grid data: Hadoop Distributed
- RDBMS, NoSQL, Large–scale processing.
- Best practices for better performance in the data ecosystem; Data storage and processing
optimization.
- Global metadata management system in the data lake.
- Benchmarking smart meter data management and processing at scale to evaluate various
large scale data management systems and approaches.

Technologies include CDH5 (Hadoop), Spark (SQL, Streaming, MLlib), Postgres-XL,
Apache Drill, MongoDB, Cassandra, Java Spring, Apache Atlas, Apache Kafka, ...
Voir plus

Beepeers / INRIAMarch 2014 - November 2014

-Migration of Beepeers social platform to Java Spring to benefit from its lightweight beans and spring data projects
-Design of Polyglot persistence distributed social platform for Beepeers.
-Technologies include Java Spring; Spring Data; MySQL; Neo4j graph database; Ehcache; Jersey Rest; MySQL
Voir plus

InriaSeptember 2010 - December 2013

-Research on data consistency management in distributed data systems and its tradeoffs in terms of Performance, Cost, and Energy consumption.
-Experiments on multi-site geographically distributed deployments of Apache Cassandra on Grid’5000 and on Amazon EC2.
-Energy consumption evaluation within Hadoop deployments
Voir plus

INRIASeptember 2010 - December 2013

Research on data consistency management in distributed data systems and its tradeoffs in terms of Performance, Cost, and Energy consumption.
-Experiments on multi-site geographically distributed deployments of Apache
Cassandra on Grid'5000 and on Amazon EC2.
-Energy consumption evaluation within Hadoop deployments

Mes compétences

Unix Shell Scripting, SQL, Spark, Python, PostgreSQL, NoSQL, Neo4J, MySQL, MongoDB, Machine Learning, Linux, LaTeX, Java, Hadoop, Google Cloud Platform (GCP), EhCache, Distributed Systems, Cloud Computing, Cassandra, C/C++, Big Data, AWS, Apache Kafka, Amazon EC2