Machine Learning Artificial Intelligence Engineer (CR227)
Location: Pleasanton, CAPosted On: 03/11/2022
Requirement Code: 56360
Requirement Detail
Required :
Technical Knowledge and Skills:
- Provide technical leadership, develop vision, gather requirements and translate client user requirements into technical architecture.
- Strong Background in Statistical modeling, NLP and Machine Learning.
- Expertise in various facets of ML and NLP, such as classification, feature engineering, information extraction, clustering, semi-supervised learning, topic modeling and ranking.
- Strong Hands-on Experience in building, deploying and productionizing ML models using software such as Spark MLLib, TensorFlow, PyTorch, Python Scikit-learn etc. is mandatory
- Ability to evaluate and choose best suited ML algorithms, perform feature engineering and optimize Machine Learning Models is mandatory
- Strong fundamentals in algorithms, data structures, statistics, predictive modeling, & distributed systems is must
- Strong Experience with Data Science Notebooks like RStudio, Jupyter, Zeppelin, PyCharm etc.
- Design and implement an integrated Big Data platform and analytics solution
- Design and implement data collectors to collect and transport data to the Big Data Platform.
- Good to have but not mandatory 4+ years of hands-on Development, Deployment and production Support experience in Hadoop environment.
- 4-5 years of programming experience in Java, Scala, Python.
- Proficient in SQL and relational database design and methods for data retrieval.
- Good to have but not mandatory building data pipelines using Hadoop components Sqoop, Hive, Spark, Spark SQL, HBase.
- Good to have but not mandatory experience with developing Hive QL, UDF's for analyzing semi structured/structured datasets.
- Good to have but not mandatory experience ingesting and processing various file formats like Avro/Parquet/Sequence Files/Text Files etc.
- Hands-on experience working in Real-Time analytics like Spark/Kafka/Storm
- Must have working experience in the data warehousing and Business Intelligence systems.
- Expertise in Unix/Linux environment in writing scripts and schedule/execute jobs.
- Successful track record of building automation scripts/code using Java, Bash, Python etc. and experience in production support issue resolution process.
MUST Haves :
- Machine Learning, NLP, Deep Learning, Python, MLLib, PyTorch, TensorFlow, Numpy/Scipy/Pandas, Spark, Hive, Data Science Notebooks