Prompt & Image Guide
The quality of your prompt (for Text-to-CAD) or source image (for Image-to-CAD) is the single biggest factor in output quality. This page covers what the pipeline handles well, what language patterns produce better results, and what to avoid.
Text-to-CAD Prompt Guide
Geometry That Works Well
Makistry's pipeline generates parametric CAD using CadQuery, a Python-based B-rep modeling kernel. It excels at:
- Prismatic shapes: boxes, cylinders, extrusions, pockets, shells
- Standard features: chamfers, fillets, drafts, thin-wall shells
- Hole patterns: through-holes, blind holes, counterbores, countersinks, slots
- Threaded features: tapped holes — specify M-size (e.g. M5, M3 × 0.5)
- Assemblies of simple mating parts: enclosures, brackets, clamps, fixtures
- Standard hardware: fasteners, bearings, gears — Makistry has a parts warehouse; name them by size (e.g. "M6 hex bolt, 20mm length", "608 ball bearing", "M3 heat-set insert")
Language Patterns That Produce Better Results
Be specific with dimensions — always include units:
✓ A 50×30×10mm rectangular plate
✗ A small flat plate
Name the feature type explicitly:
✓ An M5 counterbore: 8.5mm head diameter, 4.5mm deep, centered on the top face
✗ A screw hole on top
Specify the reference face or location:
✓ Two M4 through-holes on the long face, 15mm from each end, centered vertically
✗ Two holes on the side
Describe mating intent for assemblies:
✓ The lid sits on a 1.5mm step that runs around the inner perimeter of the base, with 0.15mm clearance
✗ The lid goes on the base
Use geometric vocabulary:
✓ Circular pocket, 20mm diameter, 5mm deep, centered on the top face
✗ A round dent in the top
Three Example Prompts at Different Levels of Complexity
Hobbyist — Lens Cap
A lens cap for a 52mm filter thread: 60mm outer diameter, 52mm inner diameter at the
retention lip (3mm deep), 3mm wall thickness throughout. Snap-fit retention ring on the
inner circumference, 1mm wide × 2mm tall, continuous around the full inner circumference.
Engineering — Mounting Bracket
An L-shaped aluminum mounting bracket. Horizontal leg: 80mm long × 40mm wide × 4mm thick.
Vertical leg: 60mm tall × 40mm wide × 4mm thick. Both legs share a 40mm width. Two M5
through-holes on the horizontal leg: 10mm from each end, centered on the 40mm width.
Triangular gusset on the inner corner: 20mm on each leg face, 4mm thick. All outer corners
chamfered 1mm.
Assembly — Electronics Enclosure
A two-part electronics enclosure. Base: 100×60×35mm external dimensions, 2mm wall thickness,
open top. Four M3 boss inserts at the interior corners: 6mm OD, 3mm ID, 5mm tall, positioned
3mm from each corner. A 2mm stepped lip runs around the full inner perimeter at the top,
1.5mm tall, to accept the lid. Lid: 100×60×12mm external, 2mm walls, closed top. Inner step
around the perimeter: 2mm tall × 2mm deep, sized to seat on the base lip with 0.2mm clearance
on all sides. Four M3 through-holes in the lid corners, aligned with the base inserts.
What's Not Supported
- Organic freeform surfaces: cloth, skin, terrain, splash forms, sculptural shapes
- Fluid or simulation geometry: anything that requires computational simulation to define shape
- Sub-0.1mm features: the kernel can represent them but the pipeline may not reliably produce them
- Assemblies with more than 8–10 distinct parts in
standardorpromode — switch to"assembly"mode, which parallelizes generation across parts - Injection molding draft analysis, wall thickness analysis, FEA — geometry only, no simulation
Image-to-CAD Source Image Guide
What Makes a Good Source Image
The image pipeline extracts edges, interprets geometry, and reconstructs a 3D B-rep model. The quality of that reconstruction depends almost entirely on how clearly the geometry is visible.
Best sources, in order of accuracy:
- Dimensioned engineering drawings — dimensions are read directly; highest accuracy
- Hand sketches with labeled dimensions — nearly as accurate; much faster to produce than a photo
- Orthographic photos (top, front, side) — good accuracy; supply multiple angles for 3D reconstruction
- Perspective product photos — moderate accuracy; geometry is estimated from visible edges
- Renders — similar to photos; useful when the physical part doesn't exist yet
Ideal Image Conditions
- Single part per image (or a clearly dominant part)
- Clean background — white, light grey, or contrasting solid color
- High contrast between part edges and background
- Even lighting — avoid shadows that obscure edges or create false edges
- Sharp focus throughout the image — especially at edges
- Orthographic projection preferred over perspective; angled photos require the model to estimate foreshortening
Using Multiple Images
Submit up to 3 images of the same part from different viewpoints. Pair with a prompt hint to identify each view:
image 1: part_front.png
image 2: part_side.png
image 3: part_top.png
prompt: "front view, right-side view, and top view of the same bracket"
Multi-view submissions significantly improve 3D reconstruction accuracy, particularly for parts with features that are only visible from certain angles.
What Does NOT Work
| Problem | Why it fails | Fix |
|---|---|---|
| Blurry or out-of-focus image | Edge detection degrades; geometry is guessed | Use a sharper photo or a drawing instead |
| Cluttered background | The pipeline can't segment the part from the scene | Isolate on a plain background |
| Assembled product where parts occlude each other | Internal geometry is invisible and will be estimated | Photograph individual parts separately |
| Hands, rulers, packaging in frame | Confuse segmentation and scale estimation | Crop to the part only |
| Organic shapes (cloth, food, living things) | B-rep CAD can't represent these geometries | Not supported |
| Fine PCB or electronic detail | Resolution limits and component density overwhelm geometry extraction | Not supported |
| High-perspective distortion (extreme wide angle) | Geometry estimation breaks down | Use a longer focal length or a drawing |
Resolution is rarely the problem. A sharp 800×600 image of a clean part will outperform a blurry 4K photo. Clarity and contrast matter more than megapixels.
Image vs. Drawing: Behavioral Differences
| Source type | Dimension handling | Typical accuracy |
|---|---|---|
| Dimensioned drawing | Dimensions read directly from annotation | High |
| Sketch with labels | Dimensions read from handwritten labels | High |
| Orthographic photo (no dims) | Proportions estimated from pixel ratios; scale inferred from context | Medium |
| Perspective photo | Foreshortening corrected; scale assumed | Medium–Low |
If you have a dimensioned drawing, always use it over a photo for the same part.