About Us: At GE Oil & Gas Digital, we are creating technology and solutions to enable social, mobile, analytical and cloud capabilities for the Industrial Internet. The Industrial Internet is an open, global network that connects people, data and machines. It’s about making infrastructure more intelligent and advancing the industries critical to the world we live in. At GE, we believe it’s about the future of industry—energy, healthcare, transportation, manufacturing. It’s about making the world work better. GE offers a great work environment, professional development, challenging careers, and competitive compensation. GE is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
Role Summary: We are looking for a highly motivated individual, passionate about technology to join the GE Oil & Gas Digital team. As the Staff Data Scientist – Big Data & Discovery, you will focus on developing impactful and innovative big data analytics products for the O&G industry.
You will be responsible for designing analytics for products, leverage strong machine learning expertise to develop new analytics for driving growth in asset, application & industry coverage and lead engagements with external/internal customers. The candidate is also expected to mentor other engineers in analytics methods.
As the Staff Data Scientist, you will:
Work in cross-functional teams to translate big-data algorithms into commercially viable products and services.
Develop self-learning big-data systems that can autocorrect and auto-learn patterns based on multiple data sources
Contribute to technical teams in development, deployment and application of applied analytics, predictive analytics and prescriptive analytics capabilities.
Work with the engineering team to incorporate your analyses and solutions, including working with the visualization team to create intuitive UI and rich UX stories. Partner with data engineers on data quality assessment, data cleansing and data analytics efforts
Gather and analyze large data sources, devise innovative data science solutions and build prototypes to enable development of high-performance algorithms in scalable, product-ready code.
Build semantic and graph models for knowledge extraction, discovery and storage and enable bot-apis
Initiate and propose unique and promising modeling features, develop new and innovative algorithms and technologies, pursuing patents where appropriate
Stay current on published state-of-the-art algorithms and competing technologies.
Contribute to the development of software and data delivery platforms that are service-oriented with reusable components across teams (multiple teams) that can be orchestrated together into different methods for different businesses.
Research and evaluate emerging technology, industry and market trends to assist in project development and/or operational support activities to for multiple teams or complex scenarios.
MS Degree in Computer Science or in “STEM” Majors (Science, Technology, Engineering and Math)
A minimum of 5 years of technical hands-on coding experience
With a minimum of 2yrs as a data scientist.
Legal authorization to work in the U.S. is required. We will not sponsor individuals for employment visas, now or in the future, for this job
Must be willing to work out of an office located in San Ramon, CA
Must be willing to travel roughly 5% of the time
PhD in Computer Science or in “STEM” Majors (Science, Technology, Engineering and Math)
Strong distributed systems and architecture knowledge, and experience with multitier architecture
Mission critical systems experience is preferred
Experience developing applications in an agile/DevOps environment would be a distinct advantage
Hands-on experience in big data, information retrieval, data mining or machine learning (Hadoop, Solr, Hive, HBase, Storm, Spark, Kafka, Yarn ,Storm, Splunk, Vertica).
Understands the concepts and technology ecosystem around both real-time and batch processing in Hadoop (HDFS, MapReduce, Spark, HBase, Hive, Beam)
Broad understanding of modern cloud computing technologies and their tradeoffs
Experience working with NoSQL Databases (HBase, Cassandra, Couchbase, etc.)
Demonstrated ability to develop containerized solutions (Docker/Mesos etc)
Strong implementation experience with high-level languages and frameworks such as R, Python, Perl, Ruby, Scala, Storm, SAS
Experienced in implementing Big Data technologies successfully in an enterprise;
Advanced knowledge in entity and relationship extraction from unstructured data;
Experienced in developing and integrating software allowing for flexible and scalable data transformation with data quality controls.
Contribute to the evolving architecture to meet growth requirements for scaling, reliability, performance and security. Strong hands-on skills in sourcing, cleaning, manipulating and analyzing large volumes of data including SQL and NoSQL databases
Create, analyze and manage projects that provide direct business benefit; demonstrate detailed knowledge of business operations and strategic direction, including merger & acquisition opportunities
Understand industry trends and competitive landscape and the implications for your GE business
Partner with business leaders to align projects with business goals and needs.
Recommends allocation of budget to meet architectural initiatives critical to business/mission success.
Develops the business case for approval.
Facilitates dialogues that produce new perspectives and trigger recommendations for substantial innovative / enhancements, and analysis of consequences.
Challenges conventional thinking and traditional ways of operating and invites stakeholders to identify issues and opportunities.
We are in the process of transitioning to an improved job application system and in the interim we are operating with two systems. Have your Job ID ready (from the email you received when you applied) to log in and check your application status.
Click the appropriate button. If you don't know your job ID, you can still check your status: use both buttons.