← Back to tools
View AGENTS.md for ssam
ssam
Split text files into training, test, and development sets using random sampling
Description
Ssam (split sampler) splits one or more text-based input files into multiple sets using random sampling. This is useful for splitting data into training, test, and development sets for machine learning.
AI Summary
Split text files into training, test, and development sets using random sampling
Capabilities
- + Random sampling of text files
- + Split into multiple data sets
- + Training/test/dev set creation
- + Process multiple input files
Use When
- → When you need to split data for machine learning
- → When creating training and test datasets
Avoid When
- x When you need stratified sampling