Software Engineer Opportunity at FactSet
Company: FactSet
Position: Software Engineer
Job Description
FactSet is actively seeking a seasoned Software Engineer with comprehensive experience in AWS cloud architecture, infrastructure deployment, and maintenance. This role involves collaboration with other engineers to deploy applications integrated with Machine Learning (ML) models focused on Natural Language Processing (NLP), classification, and Large Language Models (LLMs).
Key Responsibilities
The Software Engineer will be instrumental in developing a strategic roadmap for managing and expanding the existing pipelines and infrastructure supporting ML and AI solutions. Key tasks include:
- Deployment and maintenance of ML models, databases, and applications.
- Support for various AI/ML projects, including entity and topic modeling, semantic tagging/enrichment, information extraction, and integration of LLMs into existing frameworks.
- Creating dashboards and visualizations for financial experts.
- Ingesting and analyzing both structured and unstructured data.
- Developing processes for data collection, quality assessment, and control.
- Stay updated with technological advancements and share knowledge and passion for ML and technology with the team.
- Collaborating with other engineering teams for continuous improvement.
Essential Qualifications
The ideal candidate will possess:
- A BS or MS in Computer Science or a related field.
- 5+ years of experience as a software engineer.
- Proficiency in AWS and cloud-based infrastructure management.
- Experience in ML and NLP.
- A solid background in writing and releasing production-grade code in an enterprise environment.
- Strong analytical skills and a team-oriented mindset.
- Fluency in English and the ability to communicate complex subjects to non-technical stakeholders.
- High proficiency in Python.
- Familiarity with Docker and API development.
- Experience with various AWS services (e.g., SageMaker, S3, Athena, Glue, ECS, EC2).
Desirable Skills
Additional advantageous skills include:
- Experience with RDFs, triple stores, and relation extraction.
- Handling of large data volumes in stream or batch processing environments.
- Usage of MongoDB.
- Knowledge of deep learning libraries (Keras, PyTorch, TensorFlow).
- Experience with big data tools (Pyspark, Hive).
- Information extraction, parsing, and segmentation expertise.
- Understanding of ontologies, taxonomy resolution, and disambiguation.
- Experience with Unsupervised Learning techniques, including Density Estimation, Clustering, and Topic Modelling.
- Familiarity with graph databases (AWS Neptune, Neo4j).
- Experience with machine learning frameworks such as sklearn and ML workflow.
- Knowledge of NLP libraries and text preprocessing tools (nltk, SpaCy).
- Comfort with Large Language Model frameworks such as OpenAI and Llama
- Experience working with unstructured data (e.g., text content, JSON records), including feature engineering.
- A history of working with Agile development practices in a production environment.
If you are passionate about ML, AI, and cloud-based solutions and meet these qualifications, we encourage you to apply for this exciting opportunity at FactSet.
Apply today and take the next step in your career with FactSet!