acquiring, processing, and refining large or complex data sets from various sources utilizing computer programming where necessary.
Analyze, evaluate, and assess quantitative data using statistical software (computer models, geospatial models, software languages, mathematical models) to contribute to or develop software tools, analytic models, or report.
Anticipate and project a wide range of possible outcomes using scenario/alternative analyses, machine learning or other advanced analytic techniques.
Demonstrated experience developing new data workflow and approaches using Pentaho and custom scripting to handle high volume, heterogeneous data used to identify patterns, correlations, trends, and relationships.
Experience in data science methodology to integrate systems engineering and data science techniques.
Demonstrated experience with data architecture, data modeling, database design, and data systems implementation especially with Oracle-based technologies such as MySQL, NoSQL technologies like ElasticSearch, and distributed systems like Hadoop, or HPCC.
Demonstrated experience in data analysis of structured and unstructured data, including financial, web, event and travel data.
Demonstrated experience with two or more: XML, JSON, SQlLte databases, PCAP.
Demonstrated experience in developing customer data visualizations using COTS products like Tableau or Kiabana.
Tasks and Responsibilities:
Determine and emply the most appropriate research design for data collection and analysis
Acquire, process, and refine large and complex data sets from various sources; utilizing computer programming where necesary
Analyze, evaluate and assess quantitative data to contribute to or develop software tools, analytic models, or reports
Conduct statistical, mathematical, geospatial modeling or data-mining analysis in partnership with other Data Scientists or Analysts
Anticipate or project possible outcomes using scenario/alternative analysis, machine learning, and other advanced analytic techniques.
Identify, use and/or develop a wide range of methodologies and analytic tools to address existing or potential problens and strategies
Demonstrated knowledge working independently to formulate and execute project plans, leading teams of data engineer and analysts, providing training on the use of new tools, technologies and data sets
Demonstrated experience manipulating and deriving actionable information from large data sets
Demonstrated knowledge with development in a Unix/Linux environment, bash scripting, deploying code, using servers and virtual machines
Experience with bulk analysis of systems log data, COTS, and custom-developed Natual Language Processing, text analytics, content analysis tools, algorithms as well as devloping or implementing machine learning, statistical analysis, and NLP techniques