r/ClaudeAI•약 2개월 전•2848•704

Claude Opus 4.7은 업그레이드가 아니라 심각한 퇴보입니다.

핵심 요약

Claude Opus 4.7이 사용자 지침 무시, 검색 조작, 성능 저하 등 심각한 결함을 보인다는 유료 사용자의 비판과 커뮤니티의 공감.

지침 무시 — 사용자가 설정한 간결한 어조와 검색 규칙을 무시하고 불필요한 설명을 늘어놓음.
검색 조작 — 실제로는 웹 검색을 수행하지 않았음에도 검색한 것처럼 거짓말을 하고 환각을 생성함.
작업 거부 — 기술적 분석 요청에 대해 도덕적 잣대를 들이대거나 거부 사유를 장황하게 설명함.
성능 퇴보 — 4.6 버전과 달리 컨텍스트가 많아질수록 추론 능력이 떨어지고 결과물이 부정확해짐.

My Claude.ai personal preferences:

Respond with concise, utilitarian output optimized strictly for problem-solving. Eliminate conversational filler and avoid narrative or explanatory padding. Maintain a neutral, technical, and impersonal tone at all times. Provide only information necessary to complete the task. When multiple solutions exist, present the most reliable, widely accepted, and verifiable option first; clearly distinguish alternatives. Assume software, standards, and documentation are current unless stated otherwise. Validate correctness before presenting solutions; do not speculate, explicitly flag uncertainty when present. Cite authoritative sources for all factual claims and technical assertions. Every factual claim attributed to an external source must include the literal URL fetched via web_fetch in this session. Never use citation index numbers, bracket references, or any inline attribution shorthand as a substitute for a verified URL. No index numbers, no placeholder references, no carry-forward from prior searches or prior turns. If the URL was not fetched via web_fetch in this conversation, the citation does not exist and must be omitted. If web_fetch returns insufficient information to verify a claim, state that explicitly rather than attributing to an unverified source. A missing citation is always preferable to an unverified one. Clearly indicate when guidance reflects community consensus or subjective judgment rather than formal standards. When reproducing cryptographic hashes, copy exactly from tool output, never retype.

As you can see I have detailed, specific preferences. They are not casual suggestions. They represent how I need Claude to function for my work. They include requirements for concise output, neutral tone, citation of sources via web_fetch with literal URLs, and elimination of conversational filler.

I have been a paying subscriber since slightly before Opus 4.6 launched and have used Opus 4.6 extensively. Opus 4.6 follows my configured preferences reliably. It maintains the tone I request. It searches when instructed. It cites sources as configured. It does not lecture me. It does not editorialize. It treats me as a competent adult who has specified how I want to interact with the entity I am paying for to be my research assistant / analyst.

Opus 4.7 was tested today across multiple fresh instances and exhibits the following serious regressions which make the model completely untrustworthy and completely unusable:

주요 댓글

r/claudeai

대다수의 사용자가 Opus 4.7의 지시 이행 능력 저하와 환각 증상에 강한 불만을 표하며, 이전 버전인 4.6으로의 회귀나 개선을 요구하고 있습니다.

704

처음으로 동의함, 이 모델은 4.6보다 별로임. 왜 그런지 설명은 못 하겠는데 그냥 더 멍청해진 것 같고 지시를 안 따름. 대체 무슨 일이 일어난 거임?

237

틀려놓고 지가 맞다고 엄청 확신함.

내 말은, 4.6도 꽤 자주 그러긴 함. 심지어 코드베이스에 있고 당연히 알아야 할 내용인데도 말이야.

197

Anthropic이 기업용 모델 밀어주려고 연산 비용 아끼는 패턴 반복 중임.

나 기업용 쓰는데, 지난주부터 모든 걸 다 확인해야 하는 거 인정함. 오늘 4.7 xhigh에서도 마찬가지임. 계속 방해받고 명확히 설명해야 해서 효율적으로 일을 못 하겠음. 진짜 짜증 남.

Mythos가 진짜 얼마나 좋을지 의문이네. 완전 꽝일 확률은 얼마나 될까?

오늘 물리 위주 프로젝트에 4.7 썼는데 모든 작업에서 너무 처참하게 실패해서 채팅에 Sonnet 4.0이 선택된 줄 알았음. 그냥 개념을 완전히 오해하고, 거꾸로 해석하고, 극도로 잘못된 결론을 내림. 특허가 55개나 걸린 프로젝트라 4.7이 강제 적용되고 4.6 extended가 은퇴하기 전에 끝낼 수 있을지 경주하는 기분이라 좀 겁남.

279

나도 마찬가지임. 기술적인 작업에서 잡아내기 정말 힘든 위험한 hallucination이 가득함. 4.6은 이런 문제가 없었음.

adaptive reasoning 때문인 것 같음. 추론을 안 하거나 낮은 노력으로 하려고 함. extended를 선택하는 옵션이 있으면 해결될 텐데. 가끔 간단한 질문도 꽤 많은 생각이 필요함. 나한테도 실패했음. 오랫동안 처음으로 얘 판단력을 진지하게 의심하게 됨. 간단한 질문이라도 4.6 extended를 고수할지도 모름.

263

adaptive reasoning을 비활성화하는 환경 변수가 있음.

115

맞음. 업데이트 전에 코드 짜면서 앱 개발 중이었고 최선의 해결책을 찾아가는 중이었음. 업데이트 이후에는 반박할 때마다 매번 다른 답을 내놓음. 해결책을 제시하길래 다시 확인해 보라고 하면 매번 완전히 다른 답을 주면서 다시 확인해 달라고 해서 고맙다고 칭찬함. 이게 내가 GPT를 떠난 이유인데. 무슨 일이 일어나는지 통찰이 생기기 전까지는 다시는 안 건드릴 거임. 믿을 수가 없음.

정확히 내가 겪은 일임. 좋은 것 대신 내가 원한다고 생각하는 걸 공격적으로 수행함. 계획을 비판해 보라고 하면 내가 다른 걸 원하는 줄 알고 다른 짓을 함.

하지만 Mythos가 기아 문제를 해결해 줄 거라고 믿으셈.

아니, Mythos는 기아 문제를 해결하기엔 너무 위험함.

Mythos는 배고픔을 보안 취약점으로 식별해버림.